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(57) Abstract: The present invention provides prognostic 
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for constructing such patient subsets and of identifying 
prognosis-informative genesets for such subsets. The 
invention further provides methods of assigning a 
therapeutic regimen to an individual, microarrays useful for 
performing prognosis, kits comprising these microarrays, 
and computer systems and programs for implementing the 
methods of the invention. 
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CLASSIFICATION OF BREAST CANCER PATIENTS USING A 
COMBINATION OF CLINICAL CRITERIA AND INFORMATIVE GENESETS 



[0001] This application claims the benefit under 35 U.S.C. § 1 19(e) of U.S. Provisional 
Patent Application No. 60/650,401, filed on February 4, 2005, U.S. Provisional Patent 
Application No. 60/604,076, filed on August 24, 2004, and U.S. Provisional Patent 
Application No. 60/550,810, filed on March 5, 2004, each of which is incorporated by 
reference herein in its entirety. 

1 . FIELD OF THE INVENTION 

[0002] The present invention relates to the use of both phenotypic and genotypic aspects of 
a condition, such as a disease, in order to identify discrete subsets of patients for which 
specific sets of informative genes are then identified. The invention also relates to the 
classification of individuals, such as breast cancer patients, into a subset of the condition on 
the basis of clinical parameters and the status of markers, for example, of genes expression 
patterns, and the prognosis of those individuals on the basis of markers informative for 
prognosis within the subset of the condition. The invention also relates to methods of 
determining a course of treatment or therapy to an individual having, or suspected of having, 
a condition, such as breast cancer. The invention further relates to methods of structuring a 
clinical trial, particularly using five breast cancer-specific patient subsets and prognosis- 
informative genes for each, and of identifying patient populations for clinical trials or for 
other condition-related, for example, breast cancer-related, research. Finally, the invention 
relates to computer implementations of the above methods. 

2. BACKGROUND OF THE INVENTION 

[0003] The increased number of cancer cases reported in the United States, and, indeed, 
around the world, is a major concern. Currently there are only a handful of treatments 
available for specific types of cancer, and these provide no guarantee of success. In order to 
be most effective, these treatments require not only an early detection of the malignancy, but 
a reliable assessment of the severity of the malignancy. 

[0004] The incidence of breast cancer, a leading cause of death in women, has been 
gradually increasing in the United States over the last thirty years. Its cumulative risk is 
relatively high; 1 in 8 women are expected to develop some type of breast cancer by age 85 in 
the United States. In fact, breast cancer is the most common cancer in women and the second 
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most common cause of cancer death in the United States. In 1997, it was estimated that 
181,000 new cases were reported in the U.S., and that 44,000 people would die of breast 
cancer (Parker et al, CA Cancer J. Clin. 41:5-21 (1997); Chu et al, J. Nat Cancer Inst. 
88:1571-1579 (1996)). While mechanism of tumorigenesis for most breast carcinomas is 
largely unknown, there are genetic factors that can predispose some women to developing 
breast cancer (Miki et al, Science, 266:66-71(1994)). 

[0005] Sporadic tumors, those not currently associated with a known germline mutation, 
constitute the majority of breast cancers. It is also likely that other, non-genetic factors also 
have a significant effect on the etiology of the disease. Regardless of the cancer's origin, 
breast cancer morbidity and mortality increases significantly if it is not detected early in its 
progression. Thus, considerable effort has focused on the early detection of cellular 
transformation and tumor formation in breast tissue. 

[0006] A marker-based approach to tumor identification and characterization promises 
improved diagnostic and prognostic reliability. Typically, the diagnosis of breast cancer 
requires histopathological proof of the presence of the tumor. In addition to diagnosis, 
histopathological examinations also provide information about prognosis and selection of 
treatment regimens. Prognosis may also be established based upon clinical parameters such 
as tumor size, tumor grade, the age of the patient, and lymph node metastasis. 
[0007] Diagnosis and/or prognosis may be determined to varying degrees of effectiveness 
by direct examination of the outside of the breast, or through mammography or other X-ray 
imaging methods (Jatoi, Am. J. Surg. 177:518-524 (1999)). The latter approach is not 
without considerable cost, however. Every time a mammogram is taken, the patient incurs a 
small risk of having a breast tumor induced by the ionizing properties of the radiation used 
during the test. In addition, the process is expensive and the subjective interpretations of a 
technician can lead to imprecision. For example, one study showed major clinical 
disagreements for about one-third of a set of mammograms that were interpreted individually 
by a surveyed group of radiologists. Moreover, many women find that undergoing a 
mammogram is a painful experience. Accordingly, the National Cancer Institute has not 
recommended mammograms for women under fifty years of age, since this group is not as 
likely to develop breast cancers as are older women. It is compelling to note, however, that 
while only about 22% of breast cancers occur in women under fifty, data suggests that breast 
cancer is more aggressive in pre-menopausal women. 

[0008] In clinical practice, accurate diagnosis of various subtypes of breast cancer is 
important because treatment options, prognosis, and the likelihood of therapeutic response all 
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vary broadly depending on the diagnosis. Accurate prognosis, or determination of distant 
metastasis-free survival could allow the oncologist to tailor the administration of adjuvant 
chemotherapy, with women having poorer prognoses being given the most aggressive 
treatment. Furthermore, accurate prediction of poor prognosis would greatly impact clinical 
trials for new breast cancer therapies, because potential study patients could then be stratified 
according to prognosis. Trials could then be limited to patients having poor prognosis, in 
turn making it easier to discern if an experimental therapy is efficacious. 
[0009] To date, no set of satisfactory predictors for prognosis based on the clinical 
information alone has been identified. Many have observed that the ER status has a dominant 
signature in the breast tumor gene expression profiling. See West et aL, Proc. Natl Acad. 
Set U.S.A. 98:1 1462 (2001); van 't Veer et aL, Nature 415:530 (2002); Sorlie et aL, Proc. 
Natl. Acad. Set U.S.A. 100:8418 (2003); Perou et aL, Nature 406:141 (2000); Gruvberger et 
aL, Cancer Res. 61:5979 (2001); Sotiriou et aL, Proc. Natl. Acad. Sci. U.S.A. 100:10393 
(2003). It is generally accepted that there is some relationship between patient survival and 
ER status, van de Vijver et aL, N Engl. J. Med. 347:1999 (2002); Surowiak et al, Folia 
Histochem. CytobioL 39:143 (2001); Pichon et aL, Br. J. Cancer 73:1545 (1996); Collett et 
aL, J. Clin. Pathol. 49:920 (1996). BRCA1 mutations are related to the familial cancer 
susceptibility. Biesecker et aL, JAMA 269:1970 (1993); Easton et aL, Cancer Surv. 18:95 
(1993). Age is also considered to be a prognosis factor since young cancer patients tend to 
have poor tumors. Maggard et al, J. Surg. Res. 113:109(2003). Lymph node status is a 
factor in deciding the treatment. Eifel et aL, J. Natl. Cancer Inst 93:979 (2001). 
[0010] The discovery and characterization of BRCA1 and BRCA2 has recently expanded 
our knowledge of genetic factors which can contribute to familial breast cancer. Germ-line 
mutations within these two loci are associated with a 50 to 85% lifetime risk of breast and/or 
ovarian cancer (Casey, Curr. Opin. Oncol. 9:88-93 (1997); Marcus et aL, Cancer 11:691- 
709 (1996)). Only about 5% to 10% of breast cancers, however, are associated with breast 
cancer susceptibility genes, BRCA1 and BRCA2. The cumulative lifetime risk of breast 
cancer for women who carry the mutant BRCA1 is predicted to be approximately 92%, while 
the cumulative lifetime risk for the non-carrier majority is estimated to be approximately 
10%. BRCA1 is a tumor suppressor gene that is involved in DNA repair and cell cycle 
control, which are both important for the maintenance of genomic stability. More than 90% 
of all mutations reported so far result in a premature truncation of the protein product with 
abnormal or abolished function. The histology of breast cancer in BRCA1 mutation carriers 
differs from that in sporadic cases, but mutation analysis is the only way to find the carrier. 
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Like BRCA1, BRCA2 is involved in the development of breast cancer, and like BRCA1 plays 
a role in DNA repair. However, unlike BRCA1, it is not involved in ovarian cancer. 
[001 1] Other genes have been linked to breast cancer, for example c-erb-2 (HER2) and p53 
(Beenken et al, Ann. Surg. 233(5):630-638 (2001). Overexpression of c-erb-2 (HER2) and 
p53 have been correlated with poor prognosis (Rudolph et al, Hum. Pathol 32(3):31 1-319 
(2001), as has been aberrant expression products of mdm2 (Lukas et al, Cancer Res. 
61(7):3212-3219 (2001) and cyclinl andp27 (Porter & Roberts, International Publication 
WO98/33450, published August 6, 1998). 

[0012] The detection of BRCA1 or BRCA2 mutations represents a step towards the design of 
therapies to better control and prevent the appearance of these tumors. Recently, many 
studies have used gene expression profiling to analyze various cancers, and those studies 
have provided new diagnosis and prognosis information in the molecular level. See 
Zajchowski et al, "Identification of Gene Expression Profiled that Predict the Aggressive 
Behavior of Breast Cancer Cells," Cancer Res. 61:5168 (2001); West et al, "Predicting the 
Clinical Status of Human Breast Cancer by Using Gene Expression Profiles," Proc. Natl 
Acad. Scl U.S.A. 98:1 1462 (2001); van 't Veer et al, "Gene Expression Profiling Predicts the 
Outcome of Breast Cancer," Nature 415:530 (2002); Roberts et al, "Diagnosis and Prognosis 
of Breast Cancer Patients," WO 02/103320; Sorlie et al, Proc. Natl Acad. Scl U.S.A. 
100:8418 (2003); Perou et al, Nature 406:747 (2000); Khan et al, Cancer Res 58, 5009 
(1998); Golub et al, Science 286, 531 (1999); DeRisi et al, Nat. Genet. 14:457 (1996); 
Alizadeh et al, Nature 403, 503 (2000). Methods for the identification of informative 
genesets for various cancers have also been described. See Roberts et al, "Diagnosis and 
Prognosis of Breast Cancer Patients," WO 02/103320; Golub et al, United States Patent No. 
6,647,341. 

[0013] Genesets have been identified that are informative for differentiating individuals 
having, or suspected of having, breast cancer based on estrogen receptor (ER) status, or 
BRCA1 mutation vs. sporadic {i.e., other than BRCAl-type) mutation status. See Roberts et 
al, WO 02/103320; van't Veer et al, Nature 415:530 (2001). Genesets have also been 
identified that enable the classification of sporadic tumor-type individuals as those who will 
likely have no metastases within five years of initial diagnosis {i.e., individuals with a good 
prognosis) or those who will likely have a metastasis within five years of initial diagnosis 
{i.e., those having a poor prognosis). Roberts, supra; van't Veer, supra. 
[0014] Roberts et al WO 02/103320 describes a 70-gene set, useful for the prognosis of 
breast cancer, which outperformed clinical measures of prognosis, and which showed good 
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potential in selecting good outcome patients, thereby avoiding over-treatment, van de Vijver 
et al 9 N. Engl J, Med. 347:1999 (2002). The expression of genes with most predictive value, 
however, were not homogeneous among poor patients, suggesting the need for improvement. 
[0015] Although the patterns of gene expression as described in Roberts et ah were 
correlated with existing clinical indicators such as estrogen receptor and BRCA1 status, 
clinical measures were not incorporated. Furthermore, although the poor-outcome group in 
particular showed heterogeneity in expression pattern, the best classifier decision rule found 
during these studies was a fairly simple one based on the similarity of a patient profile to the 
average profile of a good-outcome training group. 

[0016] It is evident that breast cancer is the result of more than one type of molecular event. 
Likewise, a variety of other conditions, such as other cancers; non-cancer diseases such as 
diabetes, autoimmune or neurodegenerative disorders, obesity; etc., are also the result of 
more than one molecular event. Moreover, an individual's response to exposure to particular 
environmental conditions, for example, exposure to natural or man-made agents, such as 
toxins, pollutants, drugs, food additives, etc., likely result from more than one molecular 
event. Thus, there exists a need for improved prognostic methods so that appropriate courses 
of prophylaxis and/or therapy may be provided. Genesets having improved prognostic power 
can be identified by first identifying discrete subsets of individuals based on genotypic or 
phenotypic characteristics relevant to the disease or condition, and then identifying genesets 
informative for prognosis within those subsets of patients. Individuals having the condition, 
or who are suspected of having the condition, such as breast cancer, would then be provided 
therapies appropriate to the molecular mechanisms underlying the condition. The present 
invention provides such methods for breast cancer, and for other cancers, diseases or 
conditions. 

3 . SUMMARY OF THE INVENTION 

[0017] The present invention provides methods of identifying relevant subsets of 
conditions, and the identification of markers relevant to those subsets, for example, for 
prognosis of individuals classifiable into one of those subsets. The invention further provides 
sets of markers useful for the prognosis of individuals having breast cancer, wherein those 
patients have been classified according to one or more characteristics of breast cancer. 
[001 8] Thus, the present invention provides a method of identifying a set of informative 
genes or markers for a condition comprising a plurality of phenotypic or genotypic 
characteristics, comprising: (a) classifying each of a plurality of samples or individuals on the 
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basis of one or more phenotypic or genotypic characteristics of said condition into a plurality 
of first classes; and (b) identifying within each of said first classes a first set of genes or 
markers informative for said condition, wherein said first set of genes or markers within each 
of said first classes is unique to said class relative to other first classes. In a specific 
embodiment, this method further comprises additionally classifying into a plurality of second 
classes said samples or individuals in at least one of said first classes on the basis of a 
phenotypic or genotypic characteristic different that that used in said classifying step (a); and 
identifying within at least one of said second classes a second set of informative genes or 
markers, wherein said second set of informative genes or markers within each of said second 
classes is unique to said second class relative to other first and second classes. 
[00 1 9] The invention further provides a method of identifying a set of informative genes or 
markers for a condition comprising a plurality of phenotypic or genotypic characteristics, 
comprising: (a) classifying each of a plurality of samples or individuals on the basis of one or 
more phenotypic or genotypic characteristics into a plurality of first classes; (b) classifying at 
least one of said first classes into a plurality of second classes on the basis of phenotypic or 
genotypic characteristic different than that used in said classifying step (a); and (c) 
identifying within at least one of said first classes or said second classes a set of genes or 
markers informative for said condition, wherein said second set of genes or markers is unique 
to said class relative to other first and second classes. 

[0020] The invention further provides a method of identifying a set of informative genes or 
markers for a condition comprising a plurality of phenotypic or genotypic characteristics, 
comprising: (a) selecting a first characteristic from said plurality of phenotypic or genotypic 
characteristics; (b) identifying at least two first condition classes differentiable by said first 
characteristic; (c) selecting a plurality of individuals classifiable into at least one of said first 
condition classes; and (d) identifying in samples derived from each of said plurality of 
individuals a set of genes or markers informative for said condition within said at least one of 
said first condition classes. 

[002 1] The invention further provides a method of classifying an individual with a 
condition as having a good prognosis or a poor prognosis, comprising: (a) classifying said 
individual into one of a plurality of patient classes, said patient classes being differentiated by 
one or more phenotypic, genotypic or clinical characteristics of said condition; (b) 
determining the level of expression of a plurality of genes or their encoded proteins in a cell 
sample taken from the individual relative to a control, said plurality of genes or their encoded 
proteins comprising genes or their encoded proteins informative for prognosis of the patient 
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class into which said individual is classified; and (c) classifying said individual as having a 
good prognosis or a poor prognosis on the basis of said level of expression. In a specific 
embodiment, said condition is cancer, said good prognosis is the non-occurrence of 
metastases within five years of initial diagnosis, and said poor prognosis is the occurrence of 
metastases within five years of initial diagnosis. In a more specific embodiment, said cancer 
is breast cancer. In another specific embodiment, said control is the average level of 
expression of each of said plurality of genes or their encoded proteins across a plurality of 
samples derived from individuals identified as having a poor prognosis. In a more specific 
embodiment, said classifying step (c) is carried out by a method comprising comparing the 
level of expression of each of said plurality of genes or their encoded proteins to said average 
level of expression of each corresponding gene or its encoded protein in said control, and 
classifying said individual as having a poor prognosis if said level of expression correlates 
with said average level of expression of each of said genes or their encoded proteins in said 
control more strongly than would be expected by chance. In another specific embodiment, 
said control is the average level of expression of each of said plurality of genes or their 
encoded proteins across a plurality of samples derived from individuals identified as having a 
good prognosis. In a more specific embodiment, said classifying in step (c) is carried out by 
a method comprising comparing the level expression of each of said plurality of genes or 
their encoded proteins to said average level of expression of each corresponding gene or its 
encoded protein in said control, and classifying said individual as having a good prognosis if 
said level of expression correlates with said average level of expression of each of said genes 
or their encoded proteins in said control more strongly than would be expected by chance. In 
another specific embodiment, said plurality of patient classes comprises ERT~, BRCAI 
individuals; ER~, sporadic individuals; ER+, ER/AGE high individuals; ER+, ER/AGE low, 
LN+ individuals; and ER+, ER/AGE low, LN" individuals. 

[0022] The invention further provides a method of classifying a breast cancer patient as 
having a good prognosis or a poor prognosis comprising: (a) classifying said breast cancer 
patient as ER", BRCAI; ER~ sporadic; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or 
ER+, ER/AGE low, LN~; (b) determining the level of expression of a first plurality of genes 
in a cell sample taken from said breast cancer patient relative to a control, said first plurality 
of genes comprising two of the genes corresponding to the markers in Table 1 if said breast 
cancer patient is classified as ER~ BRCAI; in Table 2 if said breast cancer patient is 
classified as ER"" sporadic; in Table 3 if said breast cancer patient is classified as ER+, 
ER/AGE high; in Table 4 if said breast cancer patient is classified as ER+. ER/AGE low, 
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LN+; or in Table 5 if said breast cancer patient is classified as ER+, ER/AGE low, LINT; and 
(c) classifying said breast cancer patient as having a good prognosis or a poor prognosis on 
the basis of the level of expression of said first plurality of genes, wherein said breast cancer 
patient is "ER/AGE high" if the ratio of the logi 0 (ratio) of ER gene expression to age exceeds 
a predetermined value, and "ER/AGE low" if the ratio of the logi 0 (ratio) of ER gene 
expression to age does not exceed said predetermined value. In a specific embodiment, said 
control is the average level of expression of each of said plurality of genes in a plurality of 
samples derived from ER~, BRCA1 individuals, if said breast cancer patient is ER~, BRCA1; 
the average level of expression of each of said plurality of genes in a plurality of samples 
derived from ER", sporadic individuals if said breast cancer patient is ER", sporadic; the 
average level of expression of each of said plurality of genes in a plurality of samples derived 
from ER+, ER/AGE high individuals, if said breast cancer patient is ER+, ER/AGE high; the 
average level of expression of each of said plurality of genes in a plurality of samples derived 
from ER+, ER/AGE low, LN+ individuals where said breast cancer patient is ER+, ER/AGE 
low, LN+; or the average level of expression of each of said plurality of genes in a plurality 
of samples derived from ER+, ER/AGE low, LN~ individuals where said breast cancer patient 
is ER+, ER/AGE low, LIST. In a more specific embodiment, each of said individuals has a 
poor prognosis. In another more specific embodiment, each of said individuals has a good 
prognosis. In an even more specific embodiment, said classifying step (c) is carried out by a 
method comprising comparing the level of expression of each of said plurality of genes or 
their encoded proteins in a sample from said breast cancer patient to said control, and 
classifying said breast cancer patient as having a poor prognosis if said level of expression 
correlates with said average level of expression of the corresponding genes or their encoded 
proteins in said control more strongly than would be expected by chance. In another specific 
embodiment, said predetermined value of ER is calculated as ER = 0.1 (AGE - 42.5), wherein 
AGE is the age of said individual. In another specific embodiment, said individual is ER", 
BRCA1 , and said plurality of genes comprises two of the genes for which markers are listed 
in Table 1. In another specific embodiment, said individual is ER", BRCA1, and said 
plurality of genes comprises all of the genes for which markers are listed in Table 1 . In 
another specific embodiment, said individual is ER", sporadic, and said plurality of genes 
comprises two of the genes for which markers are listed in Table 2. said individual is ER", 
sporadic, and said plurality of genes comprises all of the genes for which markers are listed in 
Table 2. In another specific embodiment, said individual is ER+, ER/AGE high, and said 
plurality of genes comprises two of the genes for which markers are listed in Table 3. said 
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individual is ER+, ER/AGE high, and said plurality of genes comprises all of the genes for 
which markers are listed in Table 3. In another specific embodiment, said individual is ER+, 
ER/AGE low, LN+, and said plurality of genes comprises two of the genes for which markers 
are listed in Table 4. In another specific embodiment, said individual is ER+, ER/AGE low, 
LN+, and said plurality of genes comprises all of the genes for which markers are listed in 
Table 4. In another specific embodiment, said individual is ER+, ER/AGE low, LIST, and 
said plurality of genes comprises two of the genes for which markers are listed in Table 4. In 
another specific embodiment, said individual is ER+, ER/AGE low, LN~, and said plurality of 
genes comprises all of the genes for which markers are listed in Table 4. In another specific 
embodiment, the method further comprises determining in said cell sample the level of 
expression, relative to a control, of a second plurality of genes for which markers are not 
found in Tables 1-5, wherein said second plurality of genes is informative for prognosis. 
[0023] In another embodiment, the invention provides a method for assigning an individual 
to one of a plurality of categories in a clinical trial, comprising: (a) classifying said individual 
as ER-, BRCA1, ER~ sporadic; ER+, ER/AGE high; ER-f-, ER/AGE low, LN+; or ER+, 
ER/AGE low, LIST; (b) determining for said individual the level of expression of at least two 
genes for which markers are listed in Table 1 if said individual is classified as ER", BRCA1; 
Table 2 if said individual is classified as ER", sporadic; Table 3 if said individual is classified 
as ER+, ER/AGE high; Table 4 if said individual is classified as ER+, ER/AGE low, LN+; or 
Table 5 if said individual is classified as ER+, ER/AGE low, LN"; (c) determining whether 
said individual has a pattern of expression of said at least two genes that correlates with a 
good prognosis or a poor prognosis; and (d) assigning said individual to one category in a 
clinical trial if said individual has a good prognosis, and assigning said individual to a second 
category in said clinical trial if said individual has a poor prognosis. In a specific 
embodiment, said individual is additionally assigned to a category in said clinical trial on the 
basis of the classification of said individual as determined in step (a). In another specific 
embodiment, said individual is additionally assigned to a category in said clinical trial on the 
basis of any other clinical, phenotypic or genotypic characteristic of breast cancer. In another 
specific embodiment, said method further comprises determining in said cell sample the level 
of expression, relative to a control, of a second plurality of genes for which markers are not 
found in Tables 1-5, wherein said second plurality of genes is informative for prognosis of 
breast cancer, and determining from the expression of said second plurality of genes, in 
addition to said first plurality of genes, whether said individual has a good prognosis or a 
poor prognosis. 
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[0024] The invention further provides a microarray comprising probes complementary and 
hybridizable to a plurality of the genes for which markers are listed in any of Tables 1-5. The 
invention further provides a microarray comprising probes complementary and hybridizable 
to a plurality of the genes for which markers are listed in Table 1, each of the genes for which 
markers are listed in Table 1, a plurality of the genes for which markers are listed in Table 2, 
each of the genes for which markers are listed in Table 2, a plurality of the genes for which 
markers are listed in Table 3, each of the genes for which markers are listed in Table 3, a 
plurality of the genes for which markers are listed in Table 4, each of the genes for which 
markers are listed in Table 4, a plurality of the genes for which markers are listed in Table 5, 
or each of the genes for which markers are listed in Table 5. The invention further provides 
any one of the above microarrays, wherein said probes are at least 50% of the probes on said 
microarray. The invention further provides any one of the above microarrays, wherein said 
probes are at least 90% of the probes on said microarray. The invention further provides 
microarray comprising probes complementary and hybridizable to a plurality of the genes for 
which markers are listed in any of Tables 1-5, wherein said probes are complementary and 
hybridizable to at least 75% of the genes for which markers are listed in Table 1; are 
complementary and hybridizable to at least 75% of the genes for which markers are listed in 
Table 2; are complementary and hybridizable to at least 75% of the genes for which markers 
are listed in Table 3; are complementary and hybridizable to at least 75% of the genes for 
which markers are listed in Table 4; and are complementary and hybridizable to at least 75% 
of the genes for which markers are listed in Table 5, wherein said probes, in total, are at least 
50% of the probes on said microarray. 

[0025] The invention further comprises a kit comprising any one of the above microarrays 
in a sealed container. 

[0026] The invention further provides a method of identifying a set of genes informative for 
a condition, said condition having a plurality of phenotypic or genotypic characteristics such 
that samples may be categorized by at least one of said phenotypic or genotypic 
characteristics into at least one characteristic class, said method comprising: (a) selecting a 
plurality of samples from individuals having said condition; (b) identifying a first set of genes 
informative for said- characteristic class using said plurality of samples; (c) predicting the 
characteristic class of each of said plurality of samples; (d) discarding samples for which said 
characteristic class is incorrectly predicted; (e) repeating steps (c) and (d) at least once; and 
(f) identifying a second set of genes informative for said characteristic class using samples in 
said plurality of samples remaining after step (e). 
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[0027] The invention further provides a method for assigning an individual to one of a 
plurality of categories in a clinical trial, comprising: (a) classifying the individual into one of 
a plurality of condition categories differentiated by at least one genotypic or phenotypic 
characteristic of the condition; (b) determining the level of expression, in a sample derived 
from said individual, of a plurality of genes informative for said condition category; (c) 
determining whether said level of expression of said plurality of genes indicates that the 
individual has a good prognosis or a poor prognosis; and (d) assigning the individual to a 
category in a clinical trial on the basis of prognosis. 

[0028] The invention also provides a method for identifying one or more sets of informative 
genes or markers for a condition in an organism, comprising: (a) subdividing a plurality of 
individuals or samples derived therefrom of the organism subject to the condition into a 
plurality of classes based on one or more clinical, phenotypic or genotypic characteristics of 
the organism, wherein each class consists of a plurality of individuals or samples derived 
therefrom of the organism each of which having one or more clinical, phenotypic or 
genotypic characteristics specific for the class; and (b) attempting to identify for each of one 
or more of said plurality of classes a set of genes or markers informative for said condition in 
individuals in said class, wherein, if a set of genes or markers informative for said condition 
in individuals in said class is obtained for any of said one or more of said plurality of classes, 
said set of genes or markers is taken as a set of informative genes or markers for said 
condition in said organism. 

[0029] In one embodiment, the method further comprises, for each of one or more of said 
classes in which a set of genes or markers informative for said condition in individuals in said 
class cannot be obtained, repeating said steps (a) and (b) on said plurality of individuals or 
samples derived therefrom in said class such that said plurality of individuals or samples 
derived therefrom in said class is subdivided into a plurality of additional classes based on 
one or more clinical, phenotypic or genotypic characteristics of said organism which are 
different from those used for defining said class, wherein for each of said plurality of 
additional classes, if a set of genes or markers informative for said condition in individuals in 
said class is obtained, said set of genes or markers is taken as a set of informative genes or 
markers for said condition in said organism. 

[0030] The invention also provides a method for identifying one or more sets of informative 
genes or markers for a condition in an organism, comprising: (a) subdividing a plurality of 
individuals or samples derived therefrom of said organism subject to said condition into a 
plurality of classes based on one or more clinical, phenotypic or genotypic characteristics of 
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said organism, wherein each said class consists of a plurality of individuals or samples 
derived therefrom of said organism each having said one or more clinical, phenotypic or 
genotypic characteristics specific for said class; (b) attempting to identify for each of one or 
more of said plurality of classes a set of genes or markers informative for said condition in 
individuals in said class, wherein if a set of genes or markers informative for said condition in 
individuals in said class is identified for any of said one or more of said classes, said set of 
genes or markers is taken as a set of informative genes or markers for a condition in said 
organism; and (c) for each of one or more of said classes in which a set of genes or markers 
informative for said condition in individuals in said class cannot be obtained, repeating said 
steps (a) and (b) on said plurality of individuals or samples derived therefrom in said class 
such that said plurality of samples or individuals in said class is subdivided into a plurality of 
additional classes based on one or more clinical, phenotypic or genotypic characteristics of 
said organism which are different from those used those used for defining said class, wherein 
for each of one or more of said plurality of additional classes, if a set of genes or markers 
informative for said condition in individuals in said class is obtained, said set of genes or 
markers is taken as a set of informative genes or markers for a condition in said organism. 
[003 1 ] In the methods of the invention, the condition can be a type of cancer. In such an 
embodiment, each of said sets of genes or markers can be informative of prognosis of 
individuals in a corresponding class. In one embodiment, the condition is breast cancer, and 
the one or more clinical, phenotypic or genotypic characteristics comprise age, ER level, 
ER/AGE, BRAC1 status, and lymph node status. 

[0032] In one embodiment, the methods of the invention further comprise generating a 
template profile comprising measurements of levels of genes or markers of the set of 
informative genes or markers for said class representative of levels of the genes or markers in 
a plurality of patients having a chosen prognosis level. 

[0033] The invention also provides a method for predicting a breast cancer patient as having 
a good prognosis or a poor prognosis, comprising: (a) classifying said breast cancer patient 
into one of the following classes: (al) ER~, BRCAI; (a2) ER~, sporadic; (a3) ER+, ER/AGE 
high; (a4) ER+, ER/AGE low, LN+; or (a5) ER+, ER/AGE low, LN~; (b) determining a 
profile comprising measurements of a plurality of genes or markers in a cell sample taken 
from said breast cancer patient, said plurality of genes markers comprising at least two of the 
genes or markers corresponding to the markers in (bl) Table 1 if said breast cancer patient is 
classified as ER - , BRCAI; (b2) Table 2 if said breast cancer patient is classified as ER~ 
sporadic; (b3) Table 3 if said breast cancer patient is classified as ER+, ER/AGE high; (b4) 
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Table 4 if said breast cancer patient is classified as ER+, ER/AGE low, LN+; or (b5) Table 5 
if said breast cancer patient is classified as ER+, ER/AGE low, LIST; and (c) classifying said 
breast cancer patient as having a good prognosis or a poor prognosis based on said profile of 
said plurality of genes or markers, wherein ER + designates a high ER level and ER" 
designates a low ER level, wherein said ER/AGE is a metric of said ER level relative to the 
age of said patient, and wherein LN + designates a greater than 0 lymph nodes status in said 
patient and LN" designates a 0 lymph nodes status in said patient. 

[0034] In one embodiment, step (c) is carried out by a method comprising comparing said 
profile to a good prognosis template and/or a poor prognosis template, and wherein said 
patient is classified as having a good prognosis if said profile has a high similarity to a good 
prognosis template or has a low similarity to a poor prognosis template or as having a poor 
prognosis if said profile has a low similarity to a good prognosis template or has a high 
similarity to a poor prognosis template. A good prognosis template comprises measurements 
of said plurality of genes or markers representative of levels of said genes or markers in a 
plurality of good outcome patients, while a poor prognosis template comprises measurements 
of said plurality of genes or markers representative of levels of said genes or markers in a 
plurality of poor outcome patients. Here a good outcome patient is a breast cancer patient 
who has non-reoccurrence of metastases within a first period of time after initial diagnosis, 
while a poor outcome patient is a patient who has reoccurrence of metastases within a second 
period of time after initial diagnosis. 

[003 5] In another embodiment, the methods for predicting the prognosis of a breast cancer 
patient further comprise determining said profile, said ER level, said LN status, and/or, said 
ER/AGE. In one embodiment, said profile is an expression profile comprising measurements 
of a plurality of transcripts in a sample derived from said patient, wherein said good 
prognosis template comprises measurements of said plurality of transcripts representative of 
expression levels of said transcripts in said plurality of good outcome patients, and wherein 
said poor prognosis template comprises measurements of said plurality of transcripts 
representative of expression levels of said transcripts in said plurality of poor outcome 
patients. 

[0036] In one embodiment, said expression profile is a differential expression profile 
comprising differential measurements of said plurality of transcripts in said sample derived 
from said patient versus measurements of said plurality of transcripts in a control sample. 
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[0037] In one embodiment, the measurement of each said transcript in said good prognosis 
template is an average of expression levels of said transcript in said plurality of good 
outcome patients. 

[0038] In one embodiment, the similarity of said expression profile to said good or poor 
prognosis template is represented by a correlation coefficient between said expression profile 
and said good or poor prognosis template, respectively, and a correlation coefficient greater 
than a correlation threshold, e.g., 0.5, indicates a high similarity and said correlation 
coefficient equal to or less than said correlation threshold indicates a low similarity. 
[0039] In another embodiment, the similarity of said expression profile to said good or poor 
prognosis template is represented by a distance between said cellular constituent profile and 
said good or poor prognosis template, respectively, and a distance less than a given value 
indicates a high similarity and said distance equal to or greater than said given value indicates 
a low similarity. 

[0040] In another embodiment, said profile comprises measurements of a plurality of 
protein species in a sample derived from said patient, wherein said good prognosis template 
comprises measurements of said plurality of protein species representative of levels of said 
protein species in said plurality of good outcome patients, and wherein said poor prognosis 
template comprises measurements of said plurality of protein species representative of levels 
of said protein species in said plurality of poor outcome patients. 

[0041] In one embodiment, said ER level is determined by measuring an expression level of 
a gene encoding said estrogen receptor, e.g., the estrogen receptor a gene, in said patient 
relative to expression level of said gene in said control sample, and said ER level is classified 
as ER + if logl0(ratio) of said expression level is greater than -0.65, and said ER level is 
classified as ER" if logl0(ratio) of said expression level is equal to or less than -0.65. 
[0042] In one embodiment, said ER/AGE is classified as high if said ER level is greater 
than c (AGE - d), and said ER/AGE is classified as low if said ER level is equal to or less 
than c ( AGE - d), wherein c is a coefficient, AGE is the age of said patient, and J is an age 
threshold. 

[0043] In a specific embodiment, said estrogen receptor level is measured by a 
polynucleotide probe that detects a transcript corresponding to the gene having accession 
number NM_000125, said control sample is a pool of breast cancer cells of different patients, 
and c = 0.1 andd = 42.5. 
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[0044] In one embodiment, said control sample is generated by pooling together cDNAs of 
said plurality of transcripts from a plurality of breast cancer patients. In another embodiment, 
said control sample is generated by pooling together synthesized cDNAs of said plurality of 
transcripts and said transcript of said gene encoding said estrogen receptor. 
[0045] In one embodiment, said individual is ERT, BRCA1, and said plurality of genes 
comprises at least two of the genes for which markers are listed in Table 1 . In one 
embodiment, said individual is ERT, BRCA1, and said plurality of genes comprises all of the 
genes for which markers are listed in Table 1. 

[0046] In another embodiment, the individual is ER~, sporadic, and said plurality of genes 
comprises at least two of the genes for which markers are listed in Table 2. In one 
embodiment, said individual is ER~, sporadic, and said plurality of genes comprises all of the 
genes for which markers are listed in Table 2. 

[0047] In still another embodiment, said individual is ER+, ER/AGE high, and said plurality 
of genes comprises at least two of the genes for which markers are listed in Table 3. In one 
embodiment, said individual is ER+, ER/AGE high, and said plurality of genes comprises all 
of the genes for which markers are listed in Table 3. 

[0048] In still another embodiment, said individual is ER+, ER/AGE low, LN+, and said 
plurality of genes comprises at least two of the genes for which markers are listed in Table 4. 
In one embodiment, said individual is ER+, ER/AGE low, LN+, and said plurality of genes 
comprises all of the genes for which markers are listed in Table 4. 

[0049] In still another embodiment, said individual is ER+, ER/AGE low, LIST, and said 
plurality of genes comprises at least two of the genes for which markers are listed in Table 4. 
In one embodiment, the individual is ER+, ER/AGE low, LN~, and said plurality of genes 
comprises all of the genes for which markers are listed in Table 4. 
[0050] In one embodiment, said profile further comprises one or more genes for which 
markers are not found in Tables 1-5, which are informative for prognosis. 
[005 1] The invention also provides a method for assigning an individual to one of a 
plurality of categories in a clinical trial, comprising assigning said individual to one category 
in a clinical trial if said individual has a good prognosis as determined by any one of the 
methods described above, and assigning said individual to a second category in said clinical 
trial if said individual has a poor prognosis as determined by any one of the methods 
described above. 



15 



WO 2005/086891 



PCT/US2005/007894 



[0052] In one embodiment, said individual is additionally assigned to a category in said 
clinical trial on the basis of the classification of said individual based on said profile, said ER 
level, said LN status, and/or, said ER/AGE. 

[0053] In one embodiment, said individual is additionally assigned to a category in said 
clinical trial on the basis of one or more other clinical, phenotypic or genotypic characteristic 
of breast cancer. 

[0054] In one embodiment, the method further comprises determining in said cell sample 
the levels of expression of said one or more genes for which markers are not found in Tables 
1-5, and determining from said expression levels of said one or more genes, whether said 
individual has a good prognosis or a poor prognosis. 

4. BRIEF DESCRIPTION OF THE DRAWINGS 

[0055] FIG. 1 depicts the decision tree that resulted in the five patient subsets used to 
identify informative prognosis-related genes. 

[0056] FIG. 2: Relationship between ER level and age. (A) Scatter plot of ER vs. age for 
ER+ patients. Black dots indicate metastases free samples, and gray dots indicate metastases 
samples. It appears that patients of ER+ group can be subdivided into "ER+, ER/AGE high" 
group (above the black line) and "ER+, ER/AGE low" (below the black line) group. The 
black line is approximated by ER = 0. 1 *(AGE-42.5), and the dashed line by ER = 0. 1 *(age- 
50). Within each population, the ER level also increases with age. (B) Age distribution of all 
patients in ER+ samples. A bimodal distribution is observed. (C) ER-modulated age (age - 
10*) distribution of all patients in ER+ samples. A bimodal distribution is observed. (D) 
Age distribution of samples with metastasis. (E) ER-modulated age distribution of samples 
with metastasis. The three peaks appearing in this distribution suggest a polymorphism. 
[0057] FIG. 3. Performance of classifier for the "ERVsporadic" group. (A) Error rate 
obtained from leave-one-out cross validation (LOOCV) for predicting the disease outcome as 
a function of the number of reporter genes used in the classifier. (B) Scatter plot between 
correlation to good group (X axis) and to poor group (Y axis). Circles indicate metastases- 
free samples, squares indicate samples with metastases. Dashed line: threshold for separating 
poor from good. (C) Error rate calculated with respect to good outcome group (good 
outcome misclassified as poor divided by total number of good), or poor outcome group 
(poor outcome misclassified as good divided by total number of poor), or the average of the 
two rates. 
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[0058] FIG. 4. Performance of classifier for the "ER+, ER/AGE high" group. (A) Error 
rate obtained from leave-one-out cross validation (LOOCV) for predicting the disease 
outcome as a function of the number of reporter genes used in the classifier. (B) Scatter plot 
between correlation to good group (X axis) and to poor group (Y axis). Circles indicate 
metastases-free samples, and squares indicate samples with metastases. Dashed line: 
threshold for separating poor from good. (C) Error rate calculated with respect to good 
outcome group (good outcome misclassified as poor divided by total number of good), or 
poor outcome group (poor outcome misclassified as good divided by total number of poor), 
or the average of the two rates. 

[0059] FIG. 5. Performance of classifier for the "ER+, ER/AGE low/LN~" group. (A) 
Error rate obtained from leave-one-out cross validation (LOOCV) for predicting the disease 
outcome as a function of the number of reporter genes used in the classifier. (B) Scatter plot 
between correlation to good group (X axis) and to poor group (Y axis). Circles indicate 
metastases-free samples, and squares indicates samples with metastases. Dashed line 
indicates the threshold for separating poor from good. (C) Error rate calculated with respect 
to good outcome group (good outcome misclassified as poor divided by total number of 
good), or poor outcome group (poor outcome misclassified as good divided by total number 
of poor), or the average of the two rates. 

[0060] FIG. 6. Performance of classifier for the "ER+, ER/AGE low/LN+" group. (A) 
Error rate obtained from leave-one-out cross validation (LOOCV) for predicting the disease 
outcome as a function of the number of reporter genes used in the classifier. (B) Scatter plot 
between correlation to good group (X axis) and to poor group (Y axis). Circles indicate 
metastases free samples, squares indicate samples with metastases. Dashed line: threshold 
for separating poor from good. (C) Error rate calculated with respect to good outcome group 
(good outcome misclassified as poor divided by total number of good), or poor outcome 
group (poor outcome misclassified as good divided by total number of poor), or the average 
of the two rates. 

[0061] FIG. 7. Performance of classifier for the "ER~, BRCA1" group. (A) Error rate 
obtained from leave-one-out cross validation (LOOCV) for predicting the disease outcome as 
a function of the number of reporter genes used in the classifier. (B) Scatter plot between 
correlation to good group (X axis) and to poor group (Y axis). Circles indicate metastases 
free samples, squares indicate samples with metastases. Dashed line: threshold for separating 
poor from good. (C) Error rate calculated with respect to good outcome group (good 
outcome misclassified as poor divided by total number of good), or poor outcome group 
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(poor outcome misclassified as good divided by total number of poor), or the average of the 
two rates. 

[0062] FIG. 8. Heatmaps of genes representing key biological functions in subgroups of 
patients: A: Cell cycle genes are predictive of outcome in patients with ER/age high. B: Cell 
cycle genes are not predictive of outcome in "ER- and sporadic" patients C: Glycolysis genes 
are predictive of outcome in patients with ER/age low and LN-. D: Glycolysis genes are not 
predictive of outcome in 'ER- & BRCA1" patients. 

5. DETAILED DESCRIPTION OF THE INVENTION 

5.1 INTRODUCTION 
[0063] The present invention provides methods for classifying individuals having a 
condition, such as a disease, into one or more subsets of individuals, where individuals in 
each subset are characterized by one or more phenotypic or genotypic characteristics of the 
condition. The individuals may be eukaryotes or prokaryotes, may be animals such as 
mammals, including but not limited to humans, primates, rodents, felines, canines, etc., birds, 
reptiles, fish, etc. "Individuals" as used herein also encompasses single-celled organisms, or 
colonies thereof, such as bacteria and yeast. The condition may be a disease, such as cancer, 
and may be a specific cancer, such as breast cancer. The condition may also be an 
environmental condition, such as exposure to a toxin, pollutant, drug, proximity to urban or 
industrial areas, etc. 

[0064] The present invention provides methods of determining the prognosis of individuals 
having a condition, such as cancer, for example, breast cancer, or who are suspected of 
having the condition, by the use of a combination of clinical, biological or biochemical 
parameters of the condition and gene expression pattern data. For prognosis, the parameters 
selected preferably relate to or affect the progression and/or outcome of the condition. The 
pattern of gene expression within a subset of individuals having the particular condition leads 
to the identification of sets of genes within a subset that is informative for that subset, for 
example, for prognosis within that subset. In general, the successful identification of sets of 
genes informative for prognosis within a particular subset justifies the selection of the 
plurality of clinical, biological or biochemical parameters of the condition on which division 
of individuals into condition subsets is based. 

[0065] In the example of breast cancer, patient groups are first classified according to at 
least one of age, lymph node (LN) status, estrogen receptor (ER) level, and BRCA1 mutation 
status into discrete patient subsets. These clinical factors have been implicated in tumor 
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etiology as well as differences in disease outcome. These characteristics are not limiting; 
other genotypic or phenotypic characteristics of breast cancer, for example, tumor grade, 
tumor size, tumor cell type, etc., may also be used, alone or in combination with those listed 
herein, in order to classify individuals. The differences in gene expression or in tumor fate 
related to these parameters likely represent differences in tumor origin and tumor genesis, and 
are therefore good candidates for tumor stratification. Genesets informative for prognosis 
within each subset are then identified. New breast cancer patients are then classified using 
the same criteria, and a prognosis is made based on the geneset specific for the patient subset 
into which the patient falls. In the process of constructing a prognosis classifier within each 
patient subset, particular attention is paid to the homogeneous patterns related to the tumor 
outcome. Emergence of such homogeneous prognosis patterns may indicate the most 
common mechanism to metastasis within a subset. At the same time, successful 
identification of such patterns also justifies the parameters being used for the tumor 
stratification. To differentiate this approach from an mRNA-alone approach, the current 
approach of integrating clinical data with the gene expression data is referred to herein as a 
"comprehensive prognosis". 

5.2 DEFINITIONS 
[0066] As used herein, "BRCA1 tumor" or "BRCA1 type" means a tumor having cells 
containing a mutation of the BRCA1 locus. 

[0067] The "absolute amplitude" of correlation means the absolute value of the correlation; 
e.g., both correlation coefficients -0.35 and 0.35 have an absolute amplitude of 0.35. 
[0068] "Marker" means a cellular constituent, or a modification of a cellular constituent 
{e.g., an entire gene, EST derived from that gene, a protein encoded by that gene, post- 
translational modification of the protein, etc.) the expression or level of which changes 
between certain conditions. Where a change in a characteristic of the constituent correlates 
with a certain condition, the constituent is a marker for that condition. 
[0069] "Marker-derived polynucleotides" means the RNA transcribed from a marker gene, 
any cDNA or cRNA produced therefrom, and any nucleic acid derived therefrom, such as 
synthetic nucleic acid having a sequence derived from the gene corresponding to the marker 
gene. 

[0070] A "similarity value" is a number that represents the degree of similarity between two 
things being compared. For example, a similarity value may be a number that indicates the 
overall similarity between a patient's expression profile of specific phenotype-related 
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markers and a template specific to that phenotype (for instance, the similarity to a "good 
prognosis" template, where the phenotype is a good prognosis). The similarity value may be 
expressed as a similarity metric, such as a correlation coefficient, or may simply be expressed 
as the expression level difference, or the aggregate of the expression level differences, 
between a patient sample and a template. 

[007 1 ] A "patient subset" is a group of individuals, all of whom have a particular condition, 
or are subject to a particular condition, which is distinguished from other individuals having 
that condition by one or more phenotypic, genotypic or clinical characteristics of the 
condition, or of a response to the condition. For example, where the condition is breast 
cancer, individuals may belong to an "ER + " or an "ER~" patient subset, or may belong to a 
particular age group patient subset. 

[0072] A gene and/or marker is "informative" for a condition, phenotype, genotype or 
clinical characteristic if the expression of the gene or marker is correlated or anticorrelated 
with the condition, phenotype, genotype or clinical characteristic to a greater degree than 
would be expected by chance. 

[0073] An individual of a given age can be classified as "ER/AGE high" if the individual' s 
ER level is higher than a threshold value for the given age. The threshold can be age- 
dependent, i.e., a different threshold for each different age. In one embodiment, the age- 
dependent threshold value is calculated as c • (AGE - d), where c is a coefficient, AGE is the 
age of the patient, and d is an age threshold. The parameters c and d depend on the ER level 
and AGE used. They can be determined by fitting patients' ER level-age distribution to a 
bimodal distribution of two subgroups each having a different ER level-age dependence. In a 
specific embodiment, c = 0.1 and d = 42.5 is used for ER levels represented by a log(ratio) of 
ER expression level. Thus, for example, the threshold for a 45-year old individual in this 
embodiment is 0.1 (45-42.5), or 0.25, and if the log(ratio) of ER expression level of the 
individual is equal to or greater than 0.25, the individual is classified as "ER/AGE high"; 
otherwise, the individual is classified as "ER/AGE low." 

5.3 IDENTIFICATION OF DIAGNOSTIC AND PROGNOSTIC MARKER 
SETS 

5.3.1 IDENTIFICATION OF CONDITION SUBSETS 
[0074] The present invention provides methods of identifying sets of genes and/or markers 
useful in the diagnosis and prognosis of breast cancer. More generally, the invention also 
provides methods of identifying sets of genes and/or markers useful in the diagnosis or 
prognosis of other cancers, and even more generally, of identifying sets of genes and/or 
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markers useful in the differentiation between subgroups of individuals having a particular 
condition, such as a disease or exposure to a particular environmental condition. 
[0075] The method may be applied to any condition for which a plurality of phenotypic or 
genotypic subsets may be identified. The condition may be a disease; for example, the 
condition may be cancer, an autoimmune disease, an inflammatory disease, an infectious 
disease, a neurological disease, a degenerative disease, etc. The condition may be 
environmental; for example, the condition may be a particular diet, geographic location, etc.; 
the condition may be exposure to a compound, including, for example, a drug, a toxin, a 
carcinogen, a foodstuff, a poison, an inhaled compound, an ingested compound, etc.; the 
condition may be a particular genetic background or predisposition to a medical condition; 
etc. 

[0076] Where the condition is cancer, the condition may be any cancer, for example, 
without limitation: leukemias, including acute leukemia, acute lymphocytic leukemia, acute 
myelocytic leukemia, myeloblasts leukemia, promyelocyte leukemia, myelomonocytic 
leukemia, monocytic leukemia, and erythroleukemia; chronic leukemia, such as chronic 
myelocytic (granulocytic) leukemia or chronic lymphocytic leukemia; polycythemia vera; 
lymphomas, such as Hodgkin's disease and non-Hodgkin's disease; multiple myeloma; 
Waldenstrom's macroglobulinemia; heavy chain disease; solid tumors, such as sarcomas and 
carcinomas, fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, 
chordoma, angiosarcoma, endotheliosarcoma, lymphangiosarcoma, 

lymphangioendotheliosarcoma, synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, 
rhabdomyosarcoma, colon carcinoma, pancreatic cancer, breast cancer, ovarian cancer, 
prostate cancer, squamous cell carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland 
carcinoma, sebaceous gland carcinoma, papillary carcinoma, papillary adenocarcinomas, 
cystadenocarcinoma, medullary carcinoma, bronchogenic carcinoma, renal cell carcinoma, 
hepatoma, bile duct carcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilms' 
tumor, cervical cancer, testicular tumor, lung carcinoma, small cell lung carcinoma, bladder 
carcinoma, epithelial carcinoma, glioma, astrocytoma, medulloblastoma, craniopharyngioma, 
ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, oligodendroglioma, 
menangioma, melanoma, neuroblastoma, or retinoblastoma; etc. 

[0077] Rather than stratifying individuals, such as patients or tumor samples derived from 
patients, by gene expression patterns in the first instance, the method of identifying sets of 
genes informative for a condition begins by identifying phenotypic, genotypic or clinical 
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subsets of individuals within the larger class of individuals having or affected by the 
condition. 

[0078] In one embodiment, the condition is cancer, and the subsets are distinguished by 
phenotypic, genotypic, and/or clinical characteristics of the cancer. In this embodiment, 
groups of individuals are classified according to one or more phenotypic, genotypic, or 
clinical characteristics relevant to the cancer into patient subsets. At any step in the process 
of subdividing a patient population into patient subsets, the expression level of one or more 
genes may be determined in order to identify whether a prognosis-informative set of genes 
may be identified for the particular patient subset. If an informative gene set is identified, but 
is not as informative as desired, the patient subset may be further divided and a new geneset 
identified. These subsets may be further subdivided. For example, a group of individuals 
affected by a particular cancer may be classified first on the basis of a phenotypic, genotypic 
or clinical characteristic A into subsets SI and S2. The levels of expression of a plurality of 
genes are then determined in tumor samples taken from individuals that fall within subsets SI 
or S2 in order to identify sets of genes informative for prognosis within these subsets. 
Subsets SI and S2 may then each be subdivided into two or more subsets based on other 
phenotypic, genotypic or clinical characteristics. The basis for subdivision, if performed, 
need not be the same for SI and S2. For example, in various embodiments, SI is not 
subdivided, while S2 is subdivided on the basis of characteristic B; or SI is subdivided based 
on characteristic B while S2 is not subdivided; or SI and S2 are both subdivided on the basis 
of characteristic B; or SI is subdivided based on characteristic B, while S2 is subdivided 
according to characteristic C; and so on. For a particular decision matrix leading to a 
plurality of patient subsets, the preferred outcome is a prognosis-informative set of genes for 
each patient subset. Different decision matrices may lead to different patient subsets, which, 
in turn, may result in different sets of prognosis-informative genes. 
[0079] In the specific example of breast cancer, a plurality of phenotypic, genotypic or 
clinical indications are used to classify a patient as being a member of one of a plurality of 
patient subsets, wherein the indications are medically, biochemically or genetically relevant 
to breast cancer. For example, a group of patients may be classified into patient subsets 
based on criteria including, but not limited to, estrogen receptor (ER) status, type of tumor 
(i.e., BRCA1 -type or sporadic), lymph node status, grade of cancer, invasiveness of the tumor, 
or age. "BRCAl-type" indicates that the BRCA1 mutation is present. In each classification 
step, a group of cancer patients may be classified into only two classes, for example, ER+ or 
ER , or into three or more subsets (for example, by tumor grade), depending upon the 
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characteristic used to determine the subsets. As used herein, "ER+" indicates that the 
estrogen receptor is expressed at some elevated level; for example, it may indicate that the 
estrogen receptor is detectably expressed, or may indicate that more than 1 0% of cells are 
histologically stained for the receptor, etc. Conversely, "ER— " indicates that the estrogen 
receptor is expressed at a reduced level or not at all; for example, it may indicate that the 
receptor is not detectably expressed, or that 10% or less of cells are histologically stained for 
the receptor, etc. Marker gene sets optimized for each phenotypic class are preferably 
determined after the subsets are established. Where informative markers for a particular 
patient subset, distinguished from another subset by a particular characteristic of the 
condition of interest, cannot be determined, the subset may be further divided by another 
characteristic of the condition to create a plurality of second patient subsets, whereupon genes 
.informative for these second patient subsets may be identified. 

[0080] FIG. 1 depicts the process, described in the Examples, of subdivision of a collection 
of breast cancer patients according to phenotypic and genotypic characteristics relevant to 
breast cancer, in preparation for identification of genes informative for prognosis. A 
collection of breast cancer tumor samples was first subdivided by estrogen receptor status. 
ER status was chosen because the presence or absence of the estrogen receptor greatly 
influences the expression of other genes. In the ER+ patient subset, it was noted that patients 
appeared to be bimodally distributed by ER level vs. age; that is, ER level dependence upon 
age tended to fall within two classes, as separated by the solid line in FIG. 2A. This 
bimodality was used to further subdivide ER+ individuals into "ER+, ER/AGE high" 
individuals and "ER+, ER/AGE low" individuals. A set of informative genes was identified 
for the ER+, ER/AGE high patient subset. An informative set was not identified for the ER+ ? 
ER/AGE low subset, however, so the subset of patients was further divided into LN+ and 
LN- individuals. Thus, in one embodiment, the present invention provides a method of 
identifying a set of informative genes or markers for a condition comprising a plurality of 
phenotypic or genotypic characteristics, comprising (a) classifying each of a plurality of 
samples or individuals on the basis of one phenotypic or genotypic characteristic into a 
plurality of first classes; and (b) identifying within each of said first classes a set of 
informative genes or markers, wherein said set of informative genes or markers within each 
said first classes is unique to said class. 

5.3.2 IDENTIFICATION OF MARKER SETS INFORMATIVE FOR 
PATIENT SUBSETS 
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[0081] Once a patient subset is identified, markers, such as genes, informative for a 
particular outcome, such as prognosis, may be identified. In one embodiment, the method for 
identifying marker sets is as follows. This example describes the use of genes and gene- 
derived nucleic acids as markers; however, proteins or other cellular constituents may be used 
as markers of the condition. 

[0082] After extraction and labeling of target polynucleotides, the expression of a plurality 
of markers, such as genes, in a sample X is compared to the expression of the plurality 
markers in a standard or control. In one embodiment, the standard or control comprises 
target markers, such as polynucleotide molecules, derived from one or more samples from a 
plurality of normal individuals, or a plurality of individuals not exposed to a particular 
condition. For example, the control, or normal, individuals may be persons without the 
particular disease or condition of interest (e.g., individuals not afflicted with breast cancer, 
where breast cancer is the disease of interest), or may be an individual not exposed to a 
particular environmental condition. The standard or control may also comprise target 
polynucleotide molecules, derived from one or more samples derived from individuals having 
a different form or stage of the same disease; a different disease or different condition, or 
individuals exposed or subjected to a different condition, than the individual from which 
sample X was obtained. The control may be a sample, or set of samples, taken from the 
individual at an earlier time, for example, to assess the progression of a condition, or the 
response to a course of therapy. 

[0083] In a preferred embodiment, the standard or control is a pool of target polynucleotide 
molecules. However, where protein levels, or the levels of any other relevant biomolecule, 
are to be compared, the pool may be a pool of proteins or the relevant biomolecule. In a 
preferred embodiment in the context of breast cancer, the pool comprises samples taken from 
a number of individuals having sporadic-type tumors. 

[0084] In another preferred embodiment, the pool comprises an artificially-generated 
population of nucleic acids designed to approximate the level of nucleic acid derived from 
each marker found in a pool of marker-derived nucleic acids derived from tumor samples. In 
another embodiment, the pool, also called a "mathematical sample pool " is represented by a 
set of expression values, rather than a set of physical polynucleotides; the level of expression 
of relevant markers in a sample from an individual with a condition, such as a disease, is 
compared to values representing control levels of expression for the same markers in the 
mathematical sample pool. Such a control may be a set of values stored on a computer. Such 
artificial or mathematical controls may be constructed for any condition of interest. 
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[0085] In another embodiment specific to breast cancer, the pool is derived from normal or 
breast cancer cell lines or cell line samples. In a preferred embodiment, the pool comprises 
samples taken from individuals within a specific patient subset, e.g., "ER+, ER/AGE high" 
individuals, wherein each of said individuals has a good prognosis, or each of said individuals 
has a poor prognosis. Of course, where, for example, expressed proteins are used as markers, 
the proteins are obtained from the individual's sample, and the standard or control could be a 
pool of proteins from a number of normal individuals, or from a number of individuals 
having a particular state of a condition, such as a pool of samples from individuals having a 
particular prognosis of breast cancer. 

[0086] The comparison may be accomplished by any means known in the art. For example, 
expression levels of various markers may be assessed by separation of target polynucleotide 
molecules (e.g., RNA or cDNA) derived from the markers in agarose or polyacrylamide gels, 
followed by hybridization with marker-specific oligonucleotide probes. Alternatively, the 
comparison may be accomplished by the labeling of target polynucleotide molecules 
followed by separation on a sequencing gel. Polynucleotide samples are placed on the gel 
such that patient and control or standard polynucleotides are in adjacent lanes. Comparison 
of expression levels is accomplished visually or by means of densitometer. In a preferred 
embodiment, the expression of all markers is assessed simultaneously by hybridization to a 
microarray. In each approach, markers meeting certain criteria are identified as informative 
for the prognosis of breast cancer. 

[0087] Marker genes are selected based upon significant difference of expression in a 
condition, such as a disease, as compared to a standard or control condition. Marker genes 
may be screened, for example, by determining whether they show significant variation within 
a set of samples of interest. Genes that do not show a significant amount of variation within 
the set of samples are presumed not to be informative for the disease or condition, and are not 
selected as markers for the disease or condition. Genes showing significant variation within 
the sample set are candidate informative genes for the disease or condition. The degree of 
variation may be estimated by calculating the difference of the expression of the gene, or 
ratio of expression between sample and control, within the set of samples. The expression, or 
ratio of expressions, may be transformed by any means, e.g., linear or log transformation. 
Selection may be made based upon either significant up- or down regulation of the marker in 
the patient sample. Selection may also be made by calculation of the statistical significance 
(i.e., the p-value) of the correlation between the expression of the marker and the disease and 
condition. Preferably, both selection criteria are used. Thus, in one embodiment of the 
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present invention, markers associated with prognosis of breast cancer within a patient subset 
are selected where the markers show both more than two-fold change (increase or decrease) 
in expression as compared to a standard, and the p-value for the correlation between the 
existence of breast cancer and the change in marker expression is no more than 0.01 (i.e., is 
statistically significant). 

[0088] In the context of the present invention, "good prognosis" indicates a desired outcome 
for a particular condition, especially a particular disease, and "poor prognosis" indicates an 
undesired outcome of the condition. For example, where the condition is cancer, a "good 
prognosis" may mean partial or complete remission, and "poor prognosis" may mean 
reappearance of the cancer after treatment. What constitutes "good prognosis" and "poor 
prognosis" is specific to the condition of interest, for example, specific to the particular 
cancer an individual suffers. For example, "good prognosis" for pancreatic cancer may be 
survival for one or two years after initial diagnosis, while "good prognosis" for Hodgkin's 
disease may be survival for five years or more. In the specific example of breast cancer, 
"good prognosis" means the likelihood of non-reoccurrence of metastases within a period of 
1, 2, 3, 4, 5 or more years after initial diagnosis, and "poor prognosis" means the likelihood 
of reoccurrence of metastasis within that period. In a more specific example, "good 
prognosis" means the likelihood of non-reoccurrence of metastases within 5 years after initial 
diagnosis, and "poor prognosis" means the likelihood of reoccurrence of metastasis within 
that period. 

[0089] In a more specific embodiment for cancer, for example, breast cancer, using a 
number of breast cancer tumor samples, markers are identified by calculation of correlation 
coefficients p between the clinical category or clinical parameter(s) c and the linear, 
logarithmic or any transform of the expression ratio r across all samples for each individual 
gene. Specifically, the correlation coefficient may be calculated as: 
[0090] ^ = (c.F)/(||c||.||F||) Equation (1) 

[0091] Markers for which the coefficient of correlation exceeds a cutoff are identified as 
prognosis-informative markers specific for a particular clinical type, e.g., good prognosis, 
within a given patient subset. Such a cutoff or threshold may correspond to a certain 
significance of discriminating genes obtained by Monte Carlo simulations. The threshold 
depends upon the number of samples used; the threshold can be calculated as 3 X \/Jn^3 , 
where 1 / V«-3 is the distribution width and n = the number of samples. In a specific 
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embodiment, markers are chosen if the correlation coefficient is greater than about 0.3 or less 
than about -0.3. 

[0092] Next, the significance of the correlation is calculated. This significance may be 
calculated by any statistical means by which such significance is calculated. In a specific 
example, a set of correlation data is generated using a Monte-Carlo technique to randomize 
the association between the expression difference of a particular marker and the clinical 
category. The frequency distribution of markers satisfying the criteria in the Monte-Carlo 
runs is used to determine whether the number of markers selected by correlation with clinical 
data is significant. 

[0093] Once a marker set is identified, the markers may be rank-ordered in order of 
significance of discrimination. One means of rank ordering is by the amplitude of correlation 
between the change in gene expression of the marker and the specific condition being 
discriminated. Another, preferred, means is to use a statistical metric. In a specific 
embodiment, the metric is a t-test-like statistic: 

[0094] f = (W-(*2fl/ Equation (2) 

/Vh («i -l) + ^2<>2 +n 2 -1)/(1/^ +0i 2 ) 

[0095] In this equation, (x x ) is the error- weighted average of the log ratio of transcript 
expression measurements within a first clinical group {e.g., good prognosis), (x 2 ) is the error- 
weighted average of log ratio within a second, related clinical group {e.g., poor prognosis), 
<r x is the variance of the log ratio within the first clinical group {e.g., good prognosis), n x is 
the number of samples for which valid measurements of log ratios are available, <x 2 is the 
variance of log ratio within the second clinical group {e.g., poor prognosis), and n 2 is the 
number of samples for which valid measurements of log ratios are available. The lvalue 
represents the variance-compensated difference between two means. 

[0096] The rank-ordered marker set may be used to optimize the number of markers in the 
set used for discrimination. This is accomplished generally in a "leave one out" method as 
follows. In a first run, a subset, for example five, of the markers from the top of the ranked 
list is used to generate a template, where out of X samples, X-l are used to generate the 
template, and the status of the remaining sample is predicted. This process is repeated for 
every sample until every one of the X samples is predicted once. In a second run, additional 
markers, for example five additional markers, are added, so that a template is now generated 
from 10 markers, and the outcome of the remaining sample is predicted. This process is 
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repeated until the entire set of markers is used to generate the template. For each of the runs, 
type 1 error (false negative) and type 2 errors (false positive) are counted; the optimal number 
of markers is that number where the type 1 error rate, or type 2 error rate, or preferably the 
total of type 1 and type 2 error rate is lowest. 

[0097] For prognostic markers, validation of the marker set may be accomplished by an 
additional statistic, a survival model. This statistic generates the probability of tumor distant 
metastases as a function of time since initial diagnosis. A number of models may be used, 
including Weibull, normal, log-normal, log logistic, log-exponential, or log-Rayleigh 
(Chapter 12 "Life Testing", S-PLUS 2000 Guide to Statistics, Vol. 2, p. 368 (2000)). For 
the "normal" model, the probability of distant metastases P at time t is calculated as 
[0098] . P = axcxp{-t 2 /T 2 ) Equation 
(3) 

[0099] where a is fixed and equal to 1 , and T is a parameter to be fitted and measures the 
"expected lifetime". 

[00100] It is preferable that the above marker identification process be iterated one or more 
times by excluding one or more samples from the marker selection or ranking {i.e., from the 
calculation of correlation). Those samples being excluded are the ones that can not be 
predicted correctly from the previous iteration. Preferably, those samples excluded from 
marker selection in this iteration process are included in the classifier performance 
evaluation, to avoid overstating the performance. 

[00101] It will be apparent to those skilled in the art that the above methods, in particular the 
statistical methods described above, are not limited to the identification of markers associated 
with the prognosis of breast cancer within a particular patient subset, but may be used to 
identify set of marker genes associated with any phenotype or condition, or with any subset 
of a phenotype or condition defined by one or more characteristics of the phenotype or 
condition. The phenotype or condition can be the presence or absence of a disease such as 
cancer, or the presence or absence of any identifying clinical condition associated with that 
cancer. In the disease context, the phenotype may be a prognosis such as a survival time, 
probability of distant metastases of a disease condition, or likelihood of a particular response 
to a therapeutic or prophylactic regimen. The phenotype need not be cancer, or a disease; the 
phenotype may be a nominal characteristic associated with a healthy individual. 
[00102] Thus, the invention provides a method of identifying a set of informative genes or 
markers for a condition comprising a plurality of phenotypic of genotypic characteristics, 
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comprising: (a) classifying each of a plurality of samples or individuals on the basis of one or 
more phenotypic or genotypic characteristics of said condition into a plurality of first classes; 
(b) identifying within each of said first classes a first set of genes or markers informative for 
said condition, wherein said first set of genes or markers within each of said first classes is 
unique to said class relative to other classes. In a specific embodiment, samples or 
individuals in at least one of said first classes are additionally classified on the basis of a 
phenotypic or genotypic characteristic different from that used to distinguish said first classes 
into a plurality of second classes, and identifying within at least one of said second classes a 
second set of informative genes or markers, wherein said second set of informative genes or 
markers within each of said second classes is unique to said second class relative to other 
classes. In another embodiment, the invention provides a method of identifying a set of 
informative genes or markers for a condition comprising a plurality of phenotypic or 
genotypic characteristics, comprising: (a) classifying each of a plurality of samples or 
individuals on the basis of one or more phenotypic or genotypic characteristics into a 
plurality of first classes; (b) classifying at least one of said first classes into a plurality of 
second classes on the basis of phenotypic or genotypic characteristic different than that used 
to distinguish said plurality of first classes; (c) identifying within at least one of said first 
classes or said second classes a set of genes or markers informative for said condition, 
wherein said set of genes or markers is unique to said class relative to other classes. The 
invention further provides a method of identifying a set of informative genes or markers for a 
condition comprising a plurality of phenotypic or genotypic characteristics, comprising: (a) 
selecting a first characteristic from said plurality of phenotypic or genotypic characteristics; 
(b) identifying at least two first condition classes differentiable by said first characteristic; (c) 
selecting a plurality of individuals classifiable into at least one of said first condition classes; 
and (d) identifying in samples derived from each of said plurality of individuals a set of genes 
or markers informative for said condition within said at least one of said first condition 
classes. 

5.3.3 CLASSIFIER GENESETS FOR FIVE PATIENT SUBSETS 
[00103] The present invention provides sets of markers useful for the prognosis of breast 
cancer. The markers were identified according to the above methods in specific subsets of 
individuals with breast cancer. Generally, the marker sets were identified within a population 
of breast cancer patients that had been first stratified into five phenotypic categories based on 
criteria relevant to breast cancer prognosis, including estrogen receptor (ER) status, lymph 
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node status, type of mutation(s) (i.e., BRCAl-type or sporadic) and age at diagnosis. More 
specifically, patients, and tumors from which samples were taken, were classified as ER - , 
sporadic (i.e., being both estrogen receptor negative and having a non-BRCAl-type tumor); 
ER , BRCA1 (i.e., being both estrogen receptor negative and having a BRCAl-type tumor); 
ER+, ER/AGE high (i.e., estrogen receptor positive with a high ratio of the log (ratio) of 
estrogen receptor gene expression to age); ER+, ER/AGE low, LN+ (i.e., estrogen receptor 
positive with a low ratio of the log (ratio) of estrogen receptor gene expression to age, lymph 
node positive); and ER + , ER/AGE low, LN~ (i.e., estrogen receptor positive with a low ratio 
of the log (ratio) of estrogen receptor gene expression to age, lymph node negative). The 
rationale for subdivision of the original patient set into these five subsets is detailed in the 
Examples (Section 6). The marker sets useful for each of the subsets above are provided in 
Tables 1-5, respectively. 



Table 1 : Geneset of 20 markers used to classify ER , sporadic individuals. 



Accession/ 
1 uontig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp_xrefJkeyword J ist 


SEQ ID 


AF055033 


IGFBP5 




u.oo 


n Rzi 


insulin-like 
growth factor 
binding protein 5 


Growth factor binding, 
Glycoprotein, Signal, 3D- 
structure 


11 


NM_000599 


IGFBP5 


-3.41 


0.43 


0.53 


insulin-like 
growth factor 
binding protein 5 


Growth factor binding, 
Glycoprotein, Signal, 3D- 
structure 


51 


L27560 


IGFBP5 


-4.55 


0 


0.52 


EST 


Hypothetical protein 


29 


AF052162 


FLJ 12443 


-0.27 


1.6 


0.52 


EST 


Hypothetical protein 


9 


NM_001456 


FLNA 


-0.61 


2.47 


0.52 


filamin A, alpha 
(actin binding 
protein 280) 


Hypothetical protein, 
Actin-binding, 
Phosphorylation, Repeat, 
Polymorphism, Disease 
mutation 


73 


NM_002205 


ITGA5 


-0.37 


2.08 


0.49 


integrin, alpha 5 
(fibronectin 
receptor, alpha 
polypeptide) 


Integrin, Cell adhesion, 
Receptor, Glycoprotein, 
Transmembrane, Signal, 
Calcium, Repeat 


93 


NM_013261 


PPARGC1 


0.09 


1.54 


0.47 


peroxisome proliferative activated receptor, 
gamma, coactivator 1 


231 


NM_001605 


AARS 


0.39 


2.36 


0.51 


alanyl-tRNA 
synthetase 


Aminoacyl-tRNA 
synthetase, Protein 
biosynthesis, Ligase, 
ATP-binding 


77 


X87949 


HSPA5 


-0.03 


2.03 


0.49 


heat shock 
70kDa protein 5 
(glucose- 
regulated 
protein, 78kDa) 


ATP-binding, Hypothetical 
protein, Endoplasmic 
reticulum, Signal 


273 


Contig50950_RC 


NGEF 


-1.17 


3.2 


0.52 


neuronal guanine nucleotide exchange 
factor 


337 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp_xref_keywordJist 


SEQ ID 


NM_005689 


ABCB6 


-0.51 


2.26 


0.48 


ATP-binding 
cassette, sub- 
family B 
(MDR/TAP), 
member 6 


ATP-binding, Transport, 
Transmembrane, 
Mitochondrion, Inner 
membrane, Transit 
peptide, Hypothetical 
protein 


187 


NM_004577 


PSPH 


-0.56 


3.05 


0.51 


phosphoserine 
phosphatase 


Hydrolase, Serine 
biosynthesis, Magnesium, 
Phosphorylation 


151 


NM_003832 


PSPHL 


-2.08 


2.18 


0.5 


phosphoserine 

phosphatase- 

like 




131 


NM_002422 


MMP3 


-0.96 


2.54 


0.5 


matrix 

metalloDroteinas 
e 3 (stromelysin 
1 , progelatinase) 


Hydrolase, 
Metal Ion ro tease 
Glycoprotein, Zinc, 
Zymogen, Calcium, 
Collagen degradation, 
Extracellular matrix, 
Signal, Polymorphism, 
3D-structure 


101 


Contig37562_RC 




-3.42 


-6.02 


-0.59 


ESTs 




293 


NM_018465 


MDS030 


-0.82 


-3.28 


-0.58 


uncharacterized 
hematopoietic 
stem/progenitor 
cells protein 
MDS030 


Hypothetical protein 


267 


Contig54661_RC 




-0.79 


-2.08 


-0.54 


ESTs 




349 


AB032969 


KIAA1 143 


-0.6 


-2.85 


-0.53 


KIAA1143 
protein 


Hypothetical protein 


1 


Contig55353_RC 


KIAA1915 


-0.27 


-1.82 


-0.47 


KIAA1915 
protein 


Hypothetical protein 


353 


NM_005213 


CSTA 


2.11 


-3.4 


-0.49 


cystatin A (stefin 
A) 


Thiol protease inhibitor, 
3D-structure 


175 



Table 2. Geneset of 10 markers used to classify ER , BRCA1 individuals. 



Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp_jeref_keywo 
rd_list 


SEQ ID 


AF005487 




6.08 


0.5 


-0.79 


HLA-DRB6 


Homo sapiens 
MHC class II 
antigen (DRB6) 
mRNA, HLA- 
DRB6*0201 allele, 
sequence. 


MHC 


3 


Contig50728_RC 
Contig53598_RC 




4.02 


0.25 


-0.77 




ESTs, Weakly similar to S26650 
DNA-binding protein 5 - human ! 
[H. sapiens] 


333 


8.41 


3.26 


-0.77 


FLJ11413 


hypothetical protein 
FLJ11413 


Hypothetical 
protein 


343 


NMJ302888 


RARR 
ES1 


6.9 


0.05 


-0.87 


RARRES1 


retinoic acid 
receptor responder 
(tazarotene 
induced) 1 


Receptor, 
Transmembrane 
, Signal-anchor 


109 
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NM_005218 


DEFB1 


5.14 


-3.02 


-0.81 


DEFB1 


defensin, beta 1 


Antibiotic, 
Signal, 3D- 
structure 


177 


U17077 


BENE 


2.72 


-1.72 


-0.77 


BENE 


BENE protein 


Transmembrane 


271 


curing I4DOO r\L/ 




1.29 


-2.31 


-0.74 




ESTs 




279 


Contig53641_RC 




-3.29 


4.23 


0.75 


MAGE-E1 


MAGE-E1 protein 


Hypothetical 
protein 


345 


Contig56678_RC 




-6.7 


-9.73 


-0.82 




ESTs, Highly similai 
THYAJHUMAN Pro 
[H.sapiens] 


to 

thymosin alpha 


357 


NM_005461 


KRML 


0.88 


-3.38 


-0.75 


MAFB 


v-maf 

musculoaponeuroti 
c fibrosarcoma 
oncogene homolog 
B (avian) 


Transcription 

regulation, 

Repressor, 

DNA-binding, 

Nuclear protein, 

Hypothetical ; 

protein 


181 



Table 3. Geneset of 50 markers used to classify ER+, ER/AGE high individuals. 



Contig No. 


Gene 


\ Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp_xref keyword 
Jist 


SEQ ID 


NMJD03600 


STK15 


-2.93 


2.08 


0.8 


serine/threonine 
kinase 6 


ATP-binding, 
Kinase, 

Serine/threonine- 
protein kinase, 
Transferase 


125 


NM_003158 


STK6 


-1.57 


1.42 


0.78 


serine/threonine 
kinase 6 


ATP-binding, 
Kinase, 

Serine/threonine- 
protein kinase, 
Transferase 


113 


NML007019 


UBCH10 


-2.98 


2.62 


0.81 


ubiquitin-conjugating 
enzyme E2C 


Hypothetical 
protein, Ubl 
conjugation 
Dathway, Ligase, 
Wultigene family, 
Mitosis, Cell cycle, 
Cell division 


217 


NM_013277 


ID-GAP 


-2.43 


2.43 


0.77 


Rac GTPase 
activating protein 1 


Hypothetical 
protein 


233 


NMJ304336 


BUB1 


-2.04 


. 1.39 


0.77 


BUB1 budding 
uninhibited by 
Denzimidazoles 1 
homolog (yeast) 


Transferase, 
Serine/threonine- 
protein kinase, 
ATP-binding, Cell 
cycle, Nuclear 
Drotein, Mitosis, 
Phosphorylation, 
Polymorphism 


147 
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Accession/ 
Contig No. 


Gene 


Avg 

good 

xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp_xref_keyword 
Jist | 


SEQ ID 


NM_006607 


PTTG2 


-1.71 


1.49 


0.72 


pituitary tumor- 
transforming 2 




211 


AK001166 


FLJ 11252 


-1.33 


0.99 


0.71 


hypothetical protein 
FLJ11252 


Hypothetical 
protein 


13 


NM_004701 


CCNB2 


-4.62 


2.01 


0.81 


cyclin B2 


Cyclin, Cell cycle, 
Cell division, 
Mitosis 


153 


Contig57584_RC 




-3.68 


2.04 


0.78 


likely ortholog of mouse gene rich 
cluster, C8 gene 


359 


NM_006845 


KNSL6 


-4.13 


1.05 


0.73 


kinesin-like 6 (mitotic 
centromere- 
associated kinesin) 


Hypothetical 
protein, Motor 
protein, 

Microtubules, ATP- 
Dinaing, ooiiea con, 
Nuclear protein 


215 


Contig38901_RC 




-3.08 


1.15 


0.75 


hypothetical protein 


Hypothetical 
proiem 


299 


NM_018410 


DKFZp76 
2E1312 


-4.38 


1.49 


0.75 


hypothetical protein 
DKFZp762E1312 


Hypothetical 
protein 


263 


NM_003981 


PRC1 


-3.52 


2.17 


0.78 


protein regulator of 
cytokinesis 1 




133 


NM_001809 


CENPA 


-5.04 


0.98 


0.75 


centromere protein 
A, 17kDa 


Hypothetical 
protein, 
Chromosomal 
protein, Nuclear 
protein, DNA- 
binding, 
Centromere, 
Antigen 


81 


NM_003504 


CDC45L 


-2.67 


1.22 


0.73 


CDC45 cell division 
cycle 45-like (S. 
cerevisiae) 


DNA replication, 
ueii cycie, iMuciear 
protein, Cell 
division 


123 


Contig41413_RC 




-5.43 


2.15 


0.74 


ribonucleotide 
reductase M2 
polypeptide 


Oxidoreductase, 
DNA replication, 
Iron 


305 


NM_004217 


STK12 


-2.17 


0.73 


0.72 


serine/threonine 
kinase 12 


Hypothetical 
protein, ATP- 
binding, Kinase, 
Serine/threonine- 
protein kinase, 
Transferase 


143 


NM_002358 


MAD2L1 


-2.65 


2.27 


0.83 


MAD2 mitotic arrest 
deficient-like 1 
(yeast) 


Cell cycle, Mitosis, 
Nuclear protein, 
3D-structure 


99 


NM_0 14321 


ORC6L 


-2.73 


1.8 


0.75 


origin recognition 
complex, subunit 6 
homolog-like (yeast) 


Hypothetical 
protein, DNA 
replication, Nuclear 
protein, DNA- 
binding 


241 


NM_012291 


KIAA0165 


-1.52 


1.55 


0.71 


extra spindle poles 
like 1 (S. cerevisiae) 


Hypothetical 
protein 


229 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


NM_004203 


PKMYT1 


-3.64 


2.2 


0.7 


' retinoblastoma-like 2 
(p130) 


ATP-binding, 
Kinase, 

Serine/threonine- 
protein kinase, 
Transferase, 
Transcription 
regulation, DNA- 
binding, Nuclear 
protein, Cell cycle, 
Phosphorylation, 
Anti-oncogene 


137 


M96577 


E2F1 


-2.14 


1.42 


0.75 


E2F transcription 
factor 1 


Transcription 
regulation, 
Activator, DNA- 
binding, Nuclear 
protein, 

Phosphorylation, 
Cell cycle, 
Apoptosis, 
Polymorphism 


33 


NM_002266 


KPNA2 


-3.77 


1.78 


0.71 


karyopherin alpha 2 
(RAG cohort 1, 
importin alpha 1) 


Transport, Protein 
transport, Repeat, 
Nuclear protein, 
Polymorphism 


95 


Contig31288_RC 




-2.63 


0.7 


0.68 


ESTs, Weakly similar to hypothetical 
protein FLJ20489 [Homo sapiens] 
H sapiens] 


289 


NM_014501 


E2-EPF 


-1.55 


1.93 


0.7 


ubiquitin carrier 
protein 


Ubl conjugation 
pathway, Ligase, 
Multigene family 


247 


NM_001168 


BIRC5 


-5.76 


2.01 


0.78 


baculoviral IAP 
repeat-containing 5 
(survivin) 


Apoptosis, Thiol 
protease inhibitor, 
Alternative splicing, 
3D-structure, 
Hypothetical 
protein, Protease, 
Receptor 


63 


NM_003258 


TK1 


-4.57 


1.38 


0.71 


thymidine kinase 1, 
soluble 


Transferase, 
Kinase, DNA 
synthesis, ATP- 
binding 


115 


NM_001254 


CDC6 


-2.46 


0.28 


0.72 


CDC6 cell division 
cycle 6 homolog (S. 
cere vis iae) 


ATP-binding, Cell 
division 


67 


NM_004900 


DJ742C19 

.2 


-2.96 


0.13 


0.69 


apolipoprotein B 
mRNA editing 
enzyme, catalytic 
polypeptide-like 3B 


Hydrolase 


161 


NM_004702 


CCNE2 


-3.12 


2.13 


0.81 


cyclin E2 

( 


Cell cycle, Cell 
division, Cyclin, 
Hypothetical 
Drotein, 

Phosphorylation, 
Alternative splicing, 
Muclear protein 


155 
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J Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


AL160131 




-3.07 


2.42 


0.7 


hypothetical protein 
MGC861 


Hypothetical 
protein 


21 


NM_0 16359 


LOC5120 
3 


-3.22 


2.61 


0.76 


nucleolar protein 
ANKT 


Hypothetical 
protein, Nuclear 
protein 


253 


NM_004856 


KNSL5 


-1.52 


1.1 


0.71 


kinesin-like 5 (mitotic 
kinesin-like protein 
1) 


Motor protein, Cell 
division, 

Microtubules, ATP- 
binding, Coiled coil, 
Mitosis, Cell cycle, 
Nuclear protein 


159 


NM_000057 


BLM 


-1.54 


0.76 


0.71 


Bloom syndrome 


Hvdrolase 
Helicase, ATP- 
binding, DNA- 
bindino Nuclear 
protein, DNA 
replication, 
Disease mutation 




NMJD18455 


BM039 


-2.44 


1.18 


0.7 


uncharacterized 
bone marrow protein 
BM039 




265 


NM_002106 


H2AFZ 


-2.49 


1.53 


0.72 


H2A histone family, 
member Z 


Chromosomal 
protein, 

Nucleosome core, 
Nuclear protein, 
DNA-binding, 
Multigene family 


91 


Contig64688 




-2.68 


3.1 


0.73 


hypothetical protein 
FLJ23468 


Hypothetical 
protein 


365 


Contig44289_RC 




-1.65 


1.6 


0.67 


ESTs 




315 


Contig28552_RC 




-1.37 


1.53 


0.68 


diaphanous homolog 
3 (Drosophila) 


Hypothetical 
protein, Coiled coil, 
Repeat, Alternative 
splicing 


281 


Contig46218_RC 




-1.31 


1.56 


0.68 


ESTs, Weakly similar to T1 9201 
hypothetical protein C1 1 G6.3 - i 
Caenorhabditis eleqans FC. eleqansl 


321 


Contig28947_RC 




-1.3 


0.98 


0.67 


cell division cycle 
25A 


Hypothetical 
protein, Cell 
division, Mitosis, 
Hydrolase, 
Alternative splicing, 
Multigene family, 
3D-structure 


<— vJ «w> 


NM_016095 


LOC5165 
9 


-1.4 


2.13 


0.67 


HSPC037 protein 


Hypothetical 
protein 


249 


NM_003090 


SNRPA1 


-3.26 


0.95 


0.7 


small nuclear 
ribonucleoprotein 
polypeptide A 


Hypothetical 
protein, Nuclear 
srotein, RNA- 
Dinding, 

Ribonucleoprotein, 
Leucine-rich 
repeat, Repeat, 
3D-structure 


111 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


NMJD02811 


PSMD7 


-2.48 


1.89 


0.7 


proteasome 
(prosome, 
macropain) 26S 
subunit, non- 
ATPase, 7 (Mov34 
homolog) 


Proteasome 


107 


Contig38288JRC 




-2.34 


0.97 


0.67 


hypothetical protein 
DKFZp762A2013 


Hypothetical 
d rote in 


297 


NM_003406 


YWHAZ 


-1.5 


2.79 


0.68 


tyrosine 3- 
monooxygenase/tryp 
tophan 5- 
monooxygenase 
activation protein, 
zeta polypeptide 


Brain, Neurone, 
Phosphorylation, 
Acetylation, 
Multiaene familv 
3D-structure 


121 


AL1 37540 


NTN4 


2.13 


-4.61 


-0.69 


netrin 4 


Hypothetical 
protein, Laminin 
EGF-like domain 

V- ■» ■ III \ W VI \J III S-i III, 

Signal 


19 


AL049367 




1.9 


-3.2 


-0.68 


EST 


Transducer, 
Prenvlation 
Lipoprotein, 
Multigene family, 
Acetylation 


15 


NMJ3 13409 


FST 


1.04 


-5.78 


-0.69 


follistatin 


Glvcoorotein 
Repeat, Signal, 
Alternative splicing 


235 


NM_000060 


BTD 


3.1 


-1.45 


-0.67 


biotinidase 


Hydrolase, 
Glycoprotein, 
Signal, Disease 
mutation 


37 


Table 4. Geneset of 50 markers used to classify ER+ ? ER/AGE low, LN+ individuals. 


Accession/ 
Contig No. 


Gene . 


Avg 
good 
xdev 


Avg 

poor 

xdev 


Correl- 
ation 


Description 


Sp xref keyword 
Jist 


SEQ ID 



36 



WO 2005/086891 



PCT/US2005/007894 



Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


NM_006417 


MTAP44 


-1.5 


3 


0.6S 


Fc fragment of IgG, 
low affinity Mb, 
receptor for (CD32) 


Hydrolase, 

Hypothetical 

protein, 

Immunoglobulin 
domain, IgG- 
binding protein, 
Receptor, 
Transmembrane, 
Glycoprotein, 
Signal, Repeat, 
Multigene family, 
Polymorphism, 
NAD, One-carbon 
metabolism, Serine 
protease, 
Zymogen, 
Protease 

Alternative splicing, 

Chromosomal 

translocation, 

Proto-oncogene, 

Galaptin, Lectin, 

Antigen 


205 


NM_006820 


GS3686 


-4.3 


4.06 


0.69 


chromosome 1 open 
reading frame 29 


Hypothetical 
protein 


213 


NM_001548 


IFIT1 


-3.4 


4.27 


0.71 


Interferon-induced 
protein with 
tetratricopeptide 
repeats 1 


Repeat, TPR 
repeat, Interferon 
induction 


75 


Contig41538_RC 




-2.5 


3.16 


0.68 


ESTs, Moderately similar to hypothetical 
protein FLJ20489 [Homo sapiens] 


307 


NM_016816 


OAS1 


-1.7 


3.29 


0.75 


2\5'-oligoadenylate 
synthetase 1 , 
40/46kDa 


RNA-binding, 
Transferase, 
Nucleotidyltransfer 
ase, Interferon 
induction, 

Alternative splicing 


255 


Contig51660_RC 




-2.1 


2.65 


0.66 


28kD interferon 
responsive protein 


Transmembrane 


339 


Contig43645_RC 




-4.8 


1.44 


0.63 


Homo sapiens, 
clone 

IMAGE:4428577, 
mRNA, partial cds 


Hypothetical 
protein 


313 


AF026941 




-4.6 


2.71 


0.63 


EST, Weakly similar 
to 2004399A 
chromosomal 
protein [Homo 
sapiens] 


Hypothetical 
protein 


5 


NM_007315 


STAT1 


-3.5 


1.8 


0.59 


signal transducer 
and activator of 
transcription 1 , 
91kDa 


Transcription 
regulation, DNA- 
binding, Nuclear 
Drotein, 

Phosphorylation, 
SH2 domain, 
Mternative splicing, 
3D-structure 


225 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


NM_002038 


G1P3 


-4.1 


5.64 


0.79 


interferon, alpha- 
inducible protein 
(clonelFI-6-16) 


Interferon 
induction, 
Transmembrane, 
Signal, Alternative 
splicing 


85 


NMJ305101 


ISG15 


-5.6 


5.34 


0.77 


interferon-stimulated 
protein, 15 kDa 


Interferon 
induction, Repeat 


169 


NM_002462 


MX1 


-6.1 


0.83 


0.56 


myxovirus (influenza 
virus) resistance 1 , 
interferon-inducible 
protein p78 (mouse) 


Hypothetical 
protein, Interferon 
induction, GTP- 
binding, Multigene 
family, Antiviral 


103 


NM_005532 


IFI27 


-5.8 


2.81 


0.59 


interferon, alpha- 
inducible protein 27 


Interferon 
induction, 
Transmembrane 


183 


NIVL002346 


LY6E 


-2.1 


3.58 


0.75 


lymphocyte antigen 
6 complex, locus E 


Signal, Antigen, 
Multigene family, 
Membrane, GPI- 
anchor 


97 


NM_016817 


OAS2 


-3.6 


1.89 


0.59 


2'-5'-oligoadenylate 
synthetase 2, 
69/71 kDa 


RNA-binding, 

Transferase, 

Nucleotidyltransfer 

ase, Repeat, 

Interferon 

induction, 

Alternative splicing, 
Myristate 


257 


Contig44909_RC 




-2.3 


1.13 


0.55 


hypothetical protein 
BC012330 


Hypothetical 
protein 


317 


NM_017414 


USP18 


-4.1 


3.37 


0.72 


ubiquitin specific 
protease 18 


Ubl conjugation 
pathway, 
Hydrolase, Thiol 
protease, 
Multigene family 


259 


NMJ304029 


IRF7 


-2.4 


3.67 


0.66 


interferon regulatory 
factor 7 


Collagen, 
Transcription 
regulation, DNA- 
binding, Nuclear 
protein, Activator, 
Alternative splicing 


135 


NM_004335 


BST2 


-3.2 


3.22 


0.57 


bone marrow 
stromal cell antigen 
2 


Transmembrane, 
Glycoprotein, 
Signal-anchor, 
Polymorphism 


145 


NM_002759 


PRKR 


-2.4 


1.8 


0.58 


protein kinase, 
interferon-inducible 
double stranded 
RNA dependent 


Transferase, 
Serine/threonine- 
protein kinase, 
ATP-binding, 
Repeat, 

Phosphorylation, 
Interferon 
induction, RNA- 
binding, 3D- 
structure 


105 
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Accession/ 
Contig No. 


Gene 


Avg 

good 

xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


NM_006332 


IFI30 


-3.E 


S 2.6£ 


I 0.64 


interferon, gamma- 
inducible protein 30 


Oxidoreductase 

Interferon 

induction, 

Glycoprotein, 

Lysosome, Signal, 

Hypothetical 

protein 


203 


NM_009587 


LGALS9 


-3.2 


2.08 


0.6 


lectin, galactoside- 
binding, soluble, 9 
(galectin 9) 


Galaptin, Lectin, 
Repeat, Alternative 
splicing 


227 


NM_003641 


IFITM1 


-2.4 


5.54 


0.63 


interferon induced 
transmembrane 
protein 1 (9-27) 


Interferon 
induction, 
Transmembrane 


127 


NMJD17523 


HSXIAPA 
F1 


-1 


2.84 


0.7 


XIAP associated 
factor-1 


Hypothetical 
protein 


261 


NM_014314 


RIG-I 


-1.3 


3.55 


0.62 


RNA helicase 


ATP-binding, 

Helicase, 

Hydrolase, 

Hypothetical 

protein 


239 


Contig47563_RC 




-2.2 


3.11 


0.56 


ESTs 




325 | 


AI497657_RC 




-4.4 


5.61 


0.74 


guanine nucleotide 
binding protein 4 


Transducer, 
Prenylation, 
Lipoprotein, 
Multigene family 


335 


NM_000735 


CGA 


-4.3 


2.5 


0.58 


glycoprotein 
hormones, alpha 
polypeptide 


Hormone, 
Glycoprotein, 
Signal, 3D- 
structure 


53 


NMJ304988 


MAGEA1 


-1.4 


6.31 


0.64 


melanoma antigen, 
family A, 1 (directs 
expression of 
antigen MZ2-E) 


Antigen, Multigene 
family, 

Polymorphism, 
Tumor antigen 


163 


Contig54242_RC 




-1.2 


4.1 


0.65 


chromosome 17 
open reading frame 
26 


Hypothetical 
protein 


347 


NM_004710 


SYNGR2 


-1.4 


3.01 


0.54 


synaptogyrin 2 


Transmembrane 


157 


NMJ)01168 


BIRC5 


-3.7 


3.39 


0.64 


baculoviral IAP 
repeat-containing 5 
(survivin) 


Hypothetical 
Drotein, Protease, 
Receptor, 
Apoptosis, Thiol 
protease inhibitor, 
Alternative splicing, 
3D-structure 


63 


Contig41413_RC 




-4.4 


2.61 


0.57 i 
i 

f 


ibonucleotide < 
"eductase M2 I 
Dolypeptide 


Dxidoreductase, 
DNA replication, 
ron 


305 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


NM_004203 


PKMYT1 


-3.4 


3.7S 


i o.e 


> retinoblastoma-like 2 
(p130) 


ATP-binding, 
Kinase, 

Serine/threonine- 
protein kinase, 

Tra n ^fp race 
i i caj ioici doc , 

Transcription 
regulation, DNA- 
binding, Nuclear 
protein, Cell cycle, 
Phosphorylation, 
Anti-oncoaene 


137 


Contig48913_RC 




-3.1 


1.72 


0.55 


Homo sapiens, Similar to hypothetical 
protein PR01722, clone MGC:15692 
IMAGE:3351479, mRNA, complete cds 


327 


NM_005804 


DDXL 


-2.5 


1.42 


0.58 


DEAD/H (Asp-Glu- 
Ala-Asp/His) box 
polypeptide 39 


ATP-binding, 
Helicase, 
Hydrolase, 
Hypothetical 

nrntein 

yjl w LCII 1 


191 


NM_016359 


LOC5120 
3 


-1.7 


3.6 


0.57 


nucleolar protein 
ANKT 


Hypothetical 
protein, Nuclear 
d rote in 


253 


NM_001645 


APOC1 


-2.9 


3.43 


0.58 


apolipoprotein C-l 


Plasma, Lipid 
transport, VLDL, 
Sianai 3D- 
structure, 
Polymorphism 


79 


Contig37895_RC 




-2 


2.05 


0.55 


ESTs 




295 


NM_005749 


TOB1 


-1.3 


4.96 


0.59 


LI CI 1 lOUUuUl \J I 

ERBB2, 1 


ri luspnoryiaiion 


109 


NM_000269 


NME1 


-1.3 


2.98 


0.55 


non-metastatic cells 
1, protein (NM23A) 
expressed in 


Transferase, 
Kinase, ATP- I 
binding, Nuclear 
protein, Anti- 
oncogene, Disease 
mutation 


39 


NM_014462 


LSM1 


-1 


4.5 


0.57 


Lsm1 protein 


Nuclear protein, 
Ribonucleoprotein, 
mRNA splicing, 
mRNA processing, 
RNA-binding 


245 


Contig31221_RC 




-1.4 


3.83 


0.56 


HTPAP protein 




287 


NM_005326 


HAGH 


-1.9 


4.29 


0.57 


hydroxyacyl 

glutathione 

hydrolase 


Hydrolase, Zinc, 
3D-structure 


179 


Contig42342_RC 




0.78 


-3.2 


-0.6 


Homo sapiens cDNA 
FLJ39417fis, clone 
PLACE6016942 


Hypothetical 
protein 


311 


AL1 37540 


NTN4 


2.24 


-3.9 


-0.6 


netrin 4 


Laminin EGF-like 
domain, Signal, 
Hypothetical 
arotein 


19 


Contig40434_RC 




1.64 


-5.6 


-0.6 i 
1 


wingless-type MMTV 
ntegration site 
f amily, member 5A 


Developmental 
Drotein, 
Slycoprotein, 
Signal 


301 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


Contig 1632_RC 




1.03 


-3.9 


-0.6 


hypothetical protein 
MGC17921 


Hypothetical 
protein 


275 


NM_0 14246 


CELSR1 


0.95 


-4.6 


-0.6 


cadherin, EGF LAG 
seven-pass G-type 
receptor 1 (flamingo 
homolog, 
Drosophila) 


G-protein coupled 

receptor, 

Transmembrane, 

Glycoprotein, EGF- 

like domain, 

Calcium-binding, 

Laminin EGF-like 

domain, Repeat, 

Develoomental 

protein, 

Hydroxylation, 

Signal, Alternative 

splicing, 

Hypothetical 

protein 


237 


NM_005139 


ANXA3 


1.26 


-6.2 


-0.6 


annexin A3 


Annexin, 

Calcium/phospholi 

pid-binding, 

Repeat, 

Phospholipase A2 
inhibitor, 3D- 
structure, 
Poiymorphism 


171 



Table 5. Geneset of 65 markers used to classify ER+, ER/AGE low, LN individuals. 



Accession/ 
Contig No. 


Gene 


Avg 

good 

xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref keywo 
rdjist 


SEQ ID 


M55914 


MPB1 


-2.82 


1.25 


0.5 


ENOI 


enolase 1, (alpha) 


DNA-binding, 

Transcription 

regulation, 

Repressor, 

Nuclear protein, 

Lyase, 

Glycolysis, 

Magnesium, 

Multigene 

family, 

Hypothetical 

protein 


31 


NMJ305945 


MPB1 


-3.06 


1.19 


0.49 


ENQ1 


Homo sapiens 
enolase 1, (alpha) 
(ENQ1), mRNA. 


Glycolysis, 

Hypothetical 

protein, Lyase, 

Magnesium, 

DNA-binding, 

Transcription 

regulation, 

Repressor, 

Nuclear protein, 

Multigene family 


193 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl 
ation 


- Sequence 
name 


Description 


Sp xref keywc 
rdjist 


> SEQID 


NM_001428 


EN01 


-2.5c 


5 1.16 


5 0.46 


> ENOI 


enolase 1, (alpha) 


DNA-binding, 

Transcription 

regulation, 

Repressor, 

Nuclear protein, 

Lyase, 

Glycolysis, 

Magnesium, 

Multigene 

family, 

Hypothetical 

protein 


71 


NM_001216 


CA9 


-4.72 


1.49 


0.6 


CA9 


carbonic 
anhydrase IX 


Lyase, Zinc, 
Transmembrane 
, Glycoprotein, 
Antigen, Signal, 
Nuclear protein, 
Polymorphism 


65 


NM_001124 


ADM 


-5.68 


2.99 


0.56 


ADM 


Adrenomedullin 


Hormone, 
Amidation, 
Cleavage on 
pair of basic 
residues, Signal 


61 


NM_000584 


IL8 


-2.45 


2.04 


0.54 


IL8 


interleukin 8 


Cytokine, 

Chemotaxis, 

Inflammatory 

response, 

Signal, 

Alternative 

splicing, 3D- 

structure 


49 


D25328 


PFKP 


-4.19 


3.29 


0.56 


PFKP 


Phosphofructo- 
kinase, platelet 


<inase, 
""ransferase, 
Glycolysis, 
Repeat, 
Allosteric 
enzyme, 
Phosphorylation, 
Magnesium, 
Multigene family 


25 


NMJD06096 


NDRG1 


-5.45 


5.97 


0.77 


NDRG1 


N-myc downstream 
regulated gene 1 


Hypothetical 
protein, Nuclear 
Drotein, Repeat 


199 


NM_004994 


MMP9 


-5.53 


1.07 


0.49 


MMP9 

< 
< 
( 


matrix 

metalloproteinase 

3 (gelatinase B, < 

32kDa gelatinase, ; 

92kDa type IV < 

:ollagenase) < 

( 

I 

r 
< 

I 


Hydrolase, 

Metalloprotease, 

Glycoprotein, 

Zinc, Zymogen, 

Calcium, 

Dollagen 

degradation, 

Extracellular 

natrix, Repeat, 

Signal, 

3 olymorphism, 
*D-structure 


165 
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Accession/ 
Contig No. 


, Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl 
ation 


- Sequence 
name 


Description 


Sp xref keywc 
rdjist 


i SEQID 


NM_003311 


TSSC3 


-4.57 


' 5.5£ 


! 0.66 


5 TSSC3 


tumor suppressing 
candidate 3 


subtransferable 


117 


NMJ306086 


TUBB4 


-5.1S 


I 2.8E 


i 0.5S 


) TUBB4 


tubulin, beta, 4 


G-protein 

coupled 

receptor, 

Transmembrane 

, Glycoprotein, 

Phosphorylation, 

Lipoprotein, 

Paimitate, 

Polymorphism, 

Hypothetical 

protein, GTP- 

binding, 

Receptor, 

Microtubules, 

Multigene family 


197 


NM_006115 


PRAME 


-4.48 


2.77 


0.61 


PRAME 


preferentially 
expressed antigen 
in melanoma 


Antigen 


201 


NM_004345 


CAMP 


-2.02 


1.37 


0.49 


CAMP 


cathelicidin 

antimicrobial 

peptide 


Antibiotic, Signa 


149 


NM_018455 


BM039 


-2.34 


0.76 


0.47 


BM039 


uncharacterized 
bone marrow 
protein BM039 




265 


Contig49169_RC 




-1.17 


1.5 


0.46 


SUV39H2 


suppressor of 
variegation 3-9 
Prosophila) 
lomolog 2; 
hypothetical protein 
FLJ23414 


Hypothetical 
protein, Nuclear 
Drotein 


329 


Contig45032_RC 




-1.37 


0.77 


0.45 


FLJ14813 


hypothetical protein 
FLJ14813 


Hypothetical 
Drotein, ATP- 
binding, Kinase, 
Serine/threonine 
-protein kinase, 
Transferase 


319 


NM_000917 


P4HA1 


-1.54 


4.31 


0.62 


P4HA1 


procollagen- 
Droline, 2- 
oxoglutarate 4- 
dioxygenase 
proline 4- 

ydroxylase), alpha \ 
Dolypeptide I < 
J 
i 


Dioxygenase, 

Collagen, 

Oxidoreductase, 

ron, Vitamin C, 

Alternative 

splicing, 

Glycoprotein, 

Endoplasmic 

-eticulum, Signal 


57 


NM_002046 ( 


3APD 


-2.51 


3.42 


0.6 ( 


3APD < 


3 yceraldehyde-3- ( 
Dhosphate ( 
iehydrogenase I 


glycolysis, NAD, 
Dxidoreductase, 
Hypothetical 
Drotein, 

Multigene family 


87 
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Accession/ 
Contig No. 


Gene 


Avg 

good 

xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp_xref_keywo 
rd_list 


SEQ ID 


NM_000365 


TPI1 


-1.81 


2.94 


0.56 


TPI1 


triosephosphate 
isomerase 1 


Fatty acid 

biosynthesis, 

Gluconeogenesi 

s Glvcolvsis 

Isomerase, 

Pentose shunt, 

Disease 

mutation 

Polymorphism, 

3D-structure 


45 


NM_014364 


GAPDS 


-1.08 


2.88 


0.58 


GAPDS 


glyceraldehyde-3- 
phosphate 
dehydrogenase, 
testis-specific 


Glycolysis, 

Oxidoreductase, 

NAD 


243 


NM_005566 


LDHA 


-2.01 


4.01 


0.59 


LDHA 


lactate 

dehydrogenase A 


Oxidoreductase, 
NAD, Glycolysis, 
Multigene 
family, Disease 
mutation, 
Polymorphism 


185 


NM_000291 


PGK1 


-2.28 


1.68 


0.51 


PGK1 


phosphoglycerate 
kinase 1 


Kinase, 

Transferase, 

Multigene 

family, 

Glycolysis, 

Anptvlation 

nwciy mil \j ii, 

Disease 

mutation, 

Polymorph ism, 

Hereditary 

hemolytic 

anemia 


41 


NM_016185 


LOC51 1 
55 


-2.33 


2.82 


0.59 


HN1 


hematological and 
neurological 
expressed 1 




251 


NM_001168 


BIRC5 


-4.33 


2.78 


0.55 


BIRC5 


baculoviral IAP 
repeat-containing 5 
(survivin) 


Apoptosis, Thiol 

protease 

inhibitor, 

Alternative 

splicing, 3D- 

structure, 

Hypothetical 

protein, 

Protease, 

Receptor 


63 


NM_002266 


KPNA2 


-3.75 


1.34 


0.47 


KPNA2 


karyopherin alpha 
2 (RAG cohort 1, 
importin alpha 1) 


Transport, 

Protein 

transport, 

Repeat, Nuclear 

protein, 

Polymorphism 


95 


Contig31288_RC 




-2.1 


1.27 


0.5 




ESTs, Weakly similar to hypothetical 
protein FLJ20489 [Homo sapiens] 
[H. sapiens] 


289 
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Accession/ 
Contig No. 



INM 000269 



Gene 



NME1 



Avg 
good 
xdev 



-2.1 



Avg 
poor 
xdev 



3.43 



Corre 
ation 



0.55 



Sequence 
name 



NME1 



Description 



| Sp_xref_keywo| SEQ ID 
rd list 



non-metastatic 
cells 1 , protein 
(NM23A) 
expressed in 



Transferase, 
Kinase, ATP- 
binding, Nuclear | 
protein, Anti- 
pncogene, 
Disease 
mutation 



NM 003158 



STK6 



-1.23 



1.73 



0.45 



STK6 



serine/threonine 
kinase 6 



ATP-binding, 
Kinase, 

(Serine/threonine | 
-protein kinase, 
(Transferase 



INM 007274 



HBACH 



-1.83 



2.73 



0.51 



BACH 



brain acyl-CoA 
hydrolase 



Contig55188_RC 



-2.36 



3.28 



0.47 



FLJ22341 



Hydrolase, 
Serine esterase, 
Repeat 



hypothetical protein I Hypothetical 
FLJ22341 protein 



NM 002061 



GCLM 



-1.06 



1.76 



0.48 



GCLM 



glutamate-cysteine [Ligase, 
ligase, modifier Glutathione 
subunit biosynthesis 



NM 004207 



SLC16A 



-3.11 



5.07 



0.67 



SLC16A3 



NM 000582 



SPP1 



-5.09 



5.47 



0.53 



SPP1 



solute carrier familylTransport, 
16 (monocarboxylic Symport, 
acid transporters), Transmembrane! 
member 3 |, Multigene 

family 



secreted 
phosphoprotein 1 
(osteopontin, bone 
sialoprotein I, early 
~ lymphocyte 
activation 1) 



Hypothetical 
protein, 
Glycoprotein, 
Sialic acid, 
Biomineralizatio 
n, Cell adhesion,! 
Phosphorylation,! 
Signal, 
(Alternative 
splicing 



NM 001109 



ADAM8 



-2.5 



3.74 



0.45 



ADAM 8 



disintegrin and 
metalloproteinase 
domain 8 



Hydrolase, 
Metalloprotease, | 
Zinc, Signal, 
Glycoprotein, 
(Transmembrane | 
Antigen 



D50402 



SLC11A 



1.05 



3.46 



0.53 



SLC11A1 



solute carrier family 
11 (proton-coupled 
divalent metal ion 
transporters), 
member 1 



KL080235 



DKFZP5 
86E162 
1 



1.23 



1.96 



0.51 



RIS1 



Ras-induced 
senescence 1 



Transport, Iron 
transport, 
Transmembrane | 
, Glycoprotein, 
Macrophage, 
Polymorphism 



Hypothetical 
protein 



Contig40552_RC 



Contig52490_RC 



INM 006461 



1.26 



3.96 



0.54 



FLJ25348 



0.64 



3.33 



0.61 



OC11623 



ypothetical protein (Hypothetical 
LJ25348 [protein 



DEEPE 
ST 



-2.1 



hypothetical protein | 
BC014072 



1.85 



0.46 



SPAG5 



sperm associated 
ntigen 5 



Hypothetical 
(protein 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


- Sequence 
name 


Description 


Sp xref keywo 
rdjist 


SEQ ID 


Contig56503_RC 




-4.2 


\ 3.3S 


> 0.5£ 


> MGC9753 


hypothetical gene 
MGC9753 


Hypothetical 
protein 


355 


Contig63525 




-1.91 


3.34 


0.£ 


FLJ 13352 


hypothetical protein 
FLJ 13352 


Hypothetical 
protein 


363 


NM_001909 


CTSD 


-0.83 


4.6 


0.51 


CTSD 


cathepsin D 
(lysosomal aspartyl 
protease) 


Hydrolase, 

Aspartyl 

protease, 

Glycoprotein, 

Lysosome, 

Signal, 

Zymogen, 

Polymorphism, 

Alzheimer's 

disease, 3D- 

structure 


83 


NM_005063 


SCD 


-2.57 


5.15 


0.48 


SCD 


stearoyl-CoA 
desaturase (delta- 
9-desaturase) 


Hypothetical 
protein, 
Endoplasmic 
reticulum, Fatty 
acid 

biosynthesis, 
Iron, 

Oxidoreductase, 
Transmembrane 


167 


NM_005165 


ALDOC 


-2.43 


5.02 


0.48 


ALDOC 


aldolase C, 

fructose- 

bisphosphate 


Lyase, Schiff 
base, 

Glycolysis, 
Multigene family 


173 


NM_000363 


TNNI3 


-0.54 


3.58 


0.48 


TNNI3 


troponin I, cardiac 


Hypothetical 
protein, Muscle 
Drotein, Actin- 
binding, 
Acetylation, 
Disease 
mutation, 
Cardiomyopathy 
, Receptor, 
Signal | 


43 


AF035284 




-1.63 


3.28 


0.47 


FADS1 


EST 


Heme, 

Hypothetical 

protein 


7 


Contig30875_RC 




-0.88 


3 


0.6 




ESTs 




285 


NM_018487 


HCA112 


-0.7 


3.54 


0.58 


HCA112 


hepatocellular 
carcinoma- 
associated antigen 
112 


Hypothetical 
orotein 


269 


NM_001323 ( 


3ST6 


-1.63 


3.84 


0.57 ( 


CST6 ( 


nystatin E/M 


Fhiol protease 
nhibitor, Signal, 
Slycoprotein 


69 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl 
ation 


- Sequence 
name 


Description 


Sp_xref keywc 
rdjist 


• ISEQ ID 


NM_006516 


SLC2A1 


-1.6€ 


3 2.2^ 


> 0.46 


5 SLC2A1 


solute carrier family 
2 (facilitated 
glucose 
transporter), 
member 1 


f Transmembrane 
, Sugar 
transport, 
Transport, 
Glycoprotein, 
Multigene 
family, Disease 
mutation 


209 

I 


NM_007267 


LAK-4P 


-1.04 


3.28 


0.61 


EVIN1 


expressed in 
activated T/LAK 
lymphocytes 


Hypothetical 
protein 


221 


NM_004710 


SYNGR 
2 


-0.84 


4.81 


0.56 


SYNGR2 


synaptogyrin 2 


Transmembrane 


I 157 


Contig63649_RC 




-1.34 


6.3 


0.75 




ESTs, Weakly simil 
chromosomal prote 
sapiens] [H.sapiens 


ar to 2004399A 
n [Homo 


361 


NM_003376 


VEGF 


-2.12 


2.42 


0.46 


VEGF 


vascular 

endothelial growth 
factor 


Hypothetical 
protein, Mitogen, 
Angiogenesis, 
Growth factor, 
Glycoprotein, 
Signal, Heparin- 
binding, 

Alternative j 
splicing, 
Multigene 
family, 3D- 
structure 


119 


NM_000799 


EPO 


-0.75 


4.01 


0.69 


EPO 


erythropoietin 


Erythrocyte j 
maturation, f 
Glycoprotein, 
Hormone, I 
Signal, I 
Pharmaceutical, I 
3D-structure 


55 


NM_006014 


DXS987 
9E 


-1.85 


3.44 


0.54 


DXS9879E 


DNA segment on ch 
(unique) 9879 expre 


romosome X 
ssed sequence I 


195 


NM_007183 


PKP3 


-0.91 


4.14 


0.48 


PKP3 


plakophilin 3 


Cell adhesion, I 
Cytoskeleton, 
Structural f 
protein, Nuclear I 
protein, Repeat 


219 


D13642 


SF3B3 


-0.65 


2.28 


0.48 


SF3B3 


splicing factor 3b, I 
subunit 3, 130kDa | 

r 

[ 
r 

r 


Hypothetical I 
Drotein, j 
Spliceosome, I. 
nRNA 
Drocessing, 
nRNA splicing, I 
siuclear protein ( 


23 


NM_003756 1 


EIF3S3 


-1.85 


2.19 


0.46 i 


EIF3S3 t 
t 
f 

£ 


sukaryotic I 
ranslation initiation F 
actor 3, subunit 3 t 
jamma, 40kDa 


nitiation factor, 
3 rotein 

)iosynthesis I 


129 
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Gene 


Avg 
good 
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Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref keywo 
rdjist 


SEQ ID 


Contig47096_RC 




-0.41 


4.52 


0.54 


PFKFB4 


6-phosphofructo-2- 
kinase/fructose- 
2,6-biphosphatase 
4 


Kinase, 

Multifunctional 

enzyme, 

Transferase, 

Hydrolase, ATP- 

binding, 

Phosphorylation, 
Multigene family 


323 


NM_004209 


SYNGR 
3 


-0.31 


3.67 


0.53 


SYNGR3 


synaptogyrin 3 


Transmembrane 


I 141 


Contig3464_RC 




0.99 


-5.81 


-0.52 




ESTs 




277 


Contig31646_RC 




1.1 


-7.76 


-0.5 


COL14A1 


collagen, type XIV, 
alpha 1 (undulin) 


Extracellular 
matrix, 

Glycoprotein, 

Hypothetical 

protein, 

Collagen, Signal 


! 291 


Contig49388_RC 




1.73 


-1.75 


-0.51 


FLJ 13322 


hypothetical protein 
FLJ 13322 


Hypothetical 
protein 


331 


Contig41887_RC 




0.37 


-5.74 


-0.47 


LOC 12422 
0 


similar to common 
salivary protein 1 


Hypothetical 
protein 


309 



[00104] 



5.4 DIAGNOSTIC AND PROGNOSTIC METHODS 
5.4.1 SAMPLE COLLECTION 
[00105] In the present invention, markers, such as target polynucleotide molecules or 
proteins, are extracted from a sample taken from an individual afflicted with a condition such 
as breast cancer. The sample may be collected in any clinically acceptable manner, but must 
be collected such that marker-derived polynucleotides (i.e., RNA) are preserved (if gene 
expression is to be measured) or proteins are preserved (if encoded proteins are to be 
measured). For example, mRNA or nucleic acids derived therefrom (i.e., cDNA or amplified 
DNA) are preferably labeled distinguishably from standard or control polynucleotide 
molecules, and both are simultaneously or independently hybridized to a microarray 
comprising some or all of the markers or marker sets or subsets described above. 
Alternatively, mRNA or nucleic acids derived therefrom may be labeled with the same label 
as the standard or control polynucleotide molecules, wherein the intensity of hybridization of 
each at a particular probe is compared. A sample may comprise any clinically relevant tissue 
sample, such as a tumor biopsy or fine needle aspirate, or a sample of bodily fluid, such as 
blood, plasma, serum, lymph, ascitic fluid, cystic fluid, urine or nipple exudate. The sample 
may be taken from a human, or, in a veterinary context, from non-human animals such as 
ruminants, horses, swine or sheep, or from domestic companion animals such as felines and 
canines. 

.. 48 



WO 2005/086891 



PCT/US2005/007894 



[00106] Methods for preparing total and poly(A)+ RNA are well known and are described 
generally in Sambrook et a/., Molecular Cloning - A Laboratory Manual (2nd Ed.), 
Vols. l-3 ? Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1989)) and 
Ausubel et al, Current Protocols in Molecular Biology, vol. 2, Current Protocols 
Publishing, New York (1994)). 

[00107] RNA may be isolated from eukaryotic cells by procedures that involve lysis of the 
cells and denaturation of the proteins contained therein. Cells of interest include wild-type 
cells (i.e., non-cancerous), drug-exposed wild-type cells, tumor- or tumor-derived cells, 
modified cells, normal or tumor cell line cells, and drug-exposed modified cells. Preferably, 
the cells are breast cancer tumor cells. 

[00108] Additional steps may be employed to remove DNA. Cell lysis may be 
accomplished with a nonionic detergent, followed by microcentrifugation to remove the 
nuclei and hence the bulk of the cellular DNA. In one embodiment, RNA is extracted from 
cells of the various types of interest using guanidinium thiocyanate lysis followed by CsCl 
centrifugation to separate the RNA from DNA (Chirgwin et ah, Biochemistry 1 8:5294-5299 
(1979)). Poly(A)+ RNA is selected by selection with oligo-dT cellulose (see Sambrook et 
al, Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York (1989). Alternatively, separation of 
RNA from DNA can be accomplished by organic extraction, for example, with hot phenol or 
phenol/chloroform/isoamyl alcohol. 

[00109] If desired, RNase inhibitors may be added to the lysis buffer. Likewise, for certain 
cell types, it may be desirable to add a protein denaturation/digestion step to the protocol. 
[001 10] For many applications, it is desirable to preferentially enrich mRNA with respect to 
other cellular RNAs, such as transfer RNA (tRNA) and ribosomal RNA (rRNA). Most 
mRNAs contain a poly(A) tail at their 3' end. This allows them to be enriched by affinity 
chromatography, for example, using oligo(dT) or poly(U) coupled to a solid support, such as 
cellulose or Sephadex™ (see Ausubel et al., Current Protocols in Molecular Biology, 
vol. 2, Current Protocols Publishing, New York (1994). Once bound, poly(A)+ mRNA is 
eluted from the affinity column using 2 mM EDTA/0.1% SDS. 

[001 1 1] The sample of RNA can comprise a plurality of different mRNA molecules, each 
different mRNA molecule having a different nucleotide sequence. In a specific embodiment, 
the mRNA molecules in the RNA sample comprise at least 5, 10, 15, 20, 25, 30, 40 or 50 
different nucleotide sequences. More preferably, the mRNA molecules of the RNA sample 
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comprise mRNA molecules corresponding to each of the marker genes. In another specific 
embodiment, the RNA sample is a mammalian RNA sample. 

[001 12] In a specific embodiment, total RNA or mRNA from cells are used in the methods of 
the invention. The source of the RNA can be cells of a plant or animal, human, mammal, 
primate, non-human animal, dog, cat, mouse, rat, bird, yeast, eukaryote, prokaryote, etc. In 
specific embodiments, the method of the invention is used with a sample containing total 
mRNA or total RNA from 1 x 10 6 cells or less. In another embodiment, proteins can be 
isolated from the foregoing sources, by methods known in the art, for use in expression 
analysis at the protein level. 

[001 13] Probes to the homologs of the marker sequences disclosed herein can be employed 
preferably when non-human nucleic acid is being assayed. 

[001 14] The methods of the invention may employ any molecule suitable as a marker. For 
example, sets of proteins informative for a particular condition, including a disease, may be 
determined. As for gene-based markers, levels of variations of different proteins in samples 
may be determined for phenotypic or genotypic subsets of the condition, and proteins 
showing significant variation in either level (abundance) or activity, or both, may be 
identified in order to create a set of proteins informative for one or more of these subsets. 
Such proteins may be identified, for example, by use of gel electrophoresis, such as one- 
dimensional polyacrylamide gel electrophoresis, two-dimensional polyacrylamide gel 
electrophoresis, nondenaturing polyacrylamide gel electrophoresis; isoelectric focusing gels, 
etc., by use of antibody arrays, etc. Of course, the particular template(s) used to classify the 
individual depends upon the type(s) of cellular constituents used as markers. For example, 
where nucleic acids (e.g., genes or nucleic acids derived from expressed genes) are used as 
markers, the template comprises nucleic acids (or the level of expression or abundance 
thereof); where proteins are used as markers, the template comprises proteins, for example, 
the level or abundance of those proteins in a set of individuals; etc. 

5.4.2 USE OF PROGNOSTIC GENESETS FOR BREAST CANCER 
[001 15] According to the present invention, once genesets informative for a plurality of 
subsets of a condition are identified, an individual is classified into one of these subsets and a 
prognosis is made based on the expression of the genes, or their encoded proteins, in the 
geneset for that subset in a breast cancer tumor sample taken from the individual. 
[001 16] For example, a particular hypothetical condition has three relevant phenotypic 
characteristics, A, B and C. In this example, based on these characteristics, genesets 
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informative for prognosis of four patient subsets A + B + ; A + B~C + ; A + B~C~; and A - are 
identified by the method described above. Thus, an individual having the condition would 
first be classified according to phenotypes A-C into one of the four patient subsets. In one 
embodiment, therefore, the invention provides for the classification of an individual having a 
condition into one of a plurality of patient subsets, wherein a set of genes informative for 
prognosis for the subset has been identified. A sample is then taken from the individual, and 
the expression of the prognostically-informative genes in the sample is analyzed and 
compared to a control. In various embodiments, the control is the average expression of 
informative genes in a pool of samples taken from good prognosis individuals classifiable 
into that patient subset; the average expression of informative genes in a pool of samples 
taken from poor prognosis individuals classifiable into that patient subset; a set of 
mathematical values that represent gene expression levels of good prognosis individuals 
classifiable into that patient subset; etc. 

[001 17] In another embodiment, a sample is taken from the individual, and the levels of 
expression of the prognostically-informative genes in the sample is analyzed. In one 
embodiment, the expression level of each gene can be compared to the expression level of the 
corresponding gene in a control of reference sample to determine a differential expression 
level. The expression profile comprising expression levels or differential expression levels of 
the plurality of genes is then compared to a template profile. In various embodiments, the 
template profile is a good prognosis template comprising the average expression of 
informative genes in samples taken from good prognosis individuals classifiable into that 
patient subset; or a poor prognosis template comprising the average expression of informative 
genes in samples taken from poor prognosis individuals classifiable into that patient subset; 
or a good prognosis profile comprising a set of mathematical values that represent gene 
expression levels of good prognosis individuals classifiable into that patient subset; etc. 
[001 18] In a specific embodiment, the condition is breast cancer, and the phenotypic, 
genotypic and/or clinical classes are: ER~, BRCA1 individuals; ER~, sporadic individuals; 
ER+, ER/AGE high individuals; ER+, ER/AGE low, LN+ individuals; and ER+, ER/AGE 
low, LN~ individuals. In this embodiment, an individual may be classified as ER+ or ER~. 
If the individual is ER~ the individual is additionally classified as having a BRCA2 -type or 
sporadic tumor. ER~ individuals are thus classified as ER~, BRCA1 or ER~, sporadic. 
Alternatively, if the individual is classified as ER+, the individual is additionally classified as 
having a high or low ratio of the log (ratio) of the level of expression of the gene encoding 
the estrogen receptor to the individual's age. Individuals having a low ratio are additionally 
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classified as LN+ or LN-. ER+ individuals are thus classified as ER+, ER/AGE high; ER+, 
ER/AGE low, LN+, or ER+, ER/AGE low, LN~. Of course, the individual's ER status, 
tumor type, age and LN status may be identified in any order, as long as the individual is 
classified into one of these five subsets. 

[001 19] Thus, in one embodiment, the invention provides a method of classifying an 
individual with a condition as having a good prognosis or a poor prognosis, comprising: (a) 
classifying said individual into one of a plurality of patient classes, said patient classes being 
differentiated by one or more phenotypic, genotypic or clinical characteristics of said 
condition; (b) determining the level of expression of a plurality of genes or their encoded 
proteins in a cell sample taken from the individual relative to a control, said plurality of genes 
or their encoded proteins comprising genes or their encoded proteins in a cell sample taken 
from the individual relative to a control, said plurality of genes or their encoded proteins 
comprising genes or their encoded proteins informative for prognosis of the patient class into 
which said individual is classified; and (c) classifying said individual as having a good 
prognosis or a poor prognosis on the basis of said level of expression. In a specific 
embodiment, said condition is breast cancer, said good prognosis is the non-occurrence of 
metastases within five years of initial diagnosis, and said poor prognosis is the occurrence of 
metastases within five years of initial diagnosis. In an more specific embodiment, said 
classifying said individual with a condition as having a good prognosis or a poor prognosis is 
carried out by comparing the level expression of each of said plurality of genes or their 
encoded proteins to said average level of expression of each corresponding gene or its 
encoded protein in said control, and classifying said individual as having a good prognosis 
poor prognosis if said level of expression correlates with said average level of expression of 
each of said genes or their encoded proteins in a good prognosis control or a poor prognosis 
control, respectively, more strongly than would be expected by chance. In a more specific 
embodiment of the method, said plurality of patient subsets comprises ER~, BRCA1 
individuals; ER - , sporadic individuals; ER+, ER/AGE high individuals; ER+, ER/AGE low, 
LN+ individuals; and ER+, ER/AGE low, LN~ individuals. In another embodiment, said 
control is the average level of expression of each of said plurality of genes informative for 
prognosis in a pool of tumor samples from individuals classified into said subset who have a 
good prognosis or good outcome, or who have a poor prognosis or good outcome. In another 
specific embodiment, said control is a set of mathematical values representing the average 
level of expression of genes informative for prognosis in tumor samples of individuals 
classifiable into said subset who have a good prognosis, or who have a poor prognosis. 
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[00120] It is evident that the different patient subsets described herein reflect different 
molecular mechanisms of the initiation of tumor formation and metastasis. Thus, the 
genesets listed in tables 1 -5 are also useful for diagnosing a person as having a particular type 
of breast cancer in the first instance. Thus, the invention also provides a method of 
diagnosing an individual as having a particular subtype of breast cancer, comprising 
determining the level of expression in a sample from said individual of a plurality of the 
genes for which markers are listed in Tables 1-5; and comparing said expression to a control, 
where said control is representative of the expression of said plurality of genes in a breast 
cancer sample of said subtype of cancer, and on the basis of said comparison, diagnosing the 
individual as having said subtype of breast cancer. In a specific embodiment, said subtype of 
cancer is selected from the group consisting of ER - , BRCA1 type; ER - , sporadic type; ER+, 
ER/AGE high type; ER+, ER/AGE low, LN+ type; and ER/AGE low, LN — type. In another 
specific embodiment, said control is the average level of expression of a plurality of the genes 
for which markers are listed in Table 1, Table 2, Table 3, Table 4 or Table 5. In another 
specific example, said comparing comprises determining the similarity of the expression of 
the genes for which markers are listed in each of Tables 1-5 in said sample taken from said 
individual to a control level of expression of the same genes for each of Tables 1-5, and 
determining whether the level of expression of said genes in said sample is most similar to 
said control expression of the genes for which markers are listed in Table 1, Table 2, Table 3, 
Table 4 or Table 5. 

[00121] In another embodiment, the invention provides a method of classifying an individual 
as having a good prognosis or a poor prognosis, comprising: (a) classifying said individual as 
ER - BRCA1; ER", sporadic; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or ER+, 
ER/AGE low, LN~; (b) determining the level of expression of a first plurality of genes in a 
cell sample taken from the individual relative to a control, said first plurality of genes 
comprising two of the genes corresponding to the markers Table 1 if said individual is 
classified as ER - BRCA1; Table 2 if said individual is classified as ER - sporadic; Table 3 if 
said individual is classified as ER+, ER/AGE high; Table 4 if said individual is classified as 
ER+, ER/AGE low, LN+; or Table 5 if said individual is classified as ER+, ER/AGE low, 
LN~, wherein said individual is "ER/AGE high" if the ratio of ER expression to age exceeds 
a predetermined value, and "ER/AGE low" if the ratio of ER expression to age does not 
exceed said predetermined value. In a specific embodiment of this method, said 
predetermined value of ER calculated as ER = 0.1 (AGE - 42.5), wherein AGE is the age of 
said individual. In another specific embodiment, said individual is ER - , BRCA1, and said 
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plurality of genes comprises (i.e., contains at least) 1, 2, 3, 4, 5, 10 or all of the genes for 
which markers are listed in Table 1. In another specific embodiment, said individual is ER~ 
sporadic, and said plurality of genes comprises (i.e., contains at least) 1, 2, 3, 4, 5, 10 or all of 
the genes for which markers are listed in Table 2. In another specific embodiment, said 
individual is ER+, ER/AGE high, and said plurality of genes comprises (i.e., contains at least) 
1, 2, 3, 4, 5, 10 or all of the genes for which markers are listed in Table 3. In another specific 
embodiment, said individual is ER+, ER/AGE low, LN+, and said plurality of genes 
comprises (i.e., contains at least) 1, 2, 3, 4, 5, 10 or all of the genes for which markers are 
listed in Table 4. In another specific embodiment, said individual is ER+, ER/AGE low, 
LN~, and said plurality of genes comprises (i.e., contains at least) 1, 2, 3, 4, 5, 10 or all of the 
genes for which markers are listed in Table 5. In another specific embodiment, the method 
additionally comprises determining in said cell sample the level of expression, relative to a 
control, of a second plurality of genes for which markers are not found in Tables 1-5, wherein 
said second plurality of genes is informative for prognosis. 

[00122] In one embodiment, the invention provides a method of classifying an individual 
with a condition as having a good prognosis or a poor prognosis, comprising: (a) classifying 
said individual into one of a plurality of patient classes, said patient classes being 
differentiated by one or more phenotypic, genotypic or clinical characteristics of said 
condition; (b) determining the levels of expression of a plurality of genes or their encoded 
proteins in a cell sample taken from the individual, optionally relative to a control, said 
plurality of genes or their encoded proteins comprising genes or their encoded proteins 
informative for prognosis of the patient class into which said individual is classified; and (c) 
classifying said individual as having a good prognosis or a poor prognosis on the basis of said 
levels of expression. In a specific embodiment, said condition is breast cancer, said good 
prognosis is the non-occurrence of metastases within five years of initial diagnosis, and said 
poor prognosis is the occurrence of metastases within five years of initial diagnosis. In an 
more specific embodiment, said classifying said individual with a condition as having a good 
prognosis or a poor prognosis is carried out by comparing the patient's expression profile of 
said plurality of genes or their encoded proteins to a good and/or poor prognosis template 
profile of expression levels of said plurality of genes or their encoded proteins, and 
classifying said individual as having a good prognosis or poor prognosis if said patient 
expression profile has a high similarity to a good prognosis template or a poor prognosis 
template, respectively. In a more specific embodiment of the method, said plurality of patient 
subsets comprises ER~ BRCA1 individuals; ER~ sporadic individuals; ER+, ER/AGE high 
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individuals; ER+, ER/AGE low, LN+ individuals; and ER+, ER/AGE low, LN~ individuals. 
In another embodiment, said good prognosis template comprises the average level of 
expression of each of said plurality of genes informative for prognosis in tumor samples from 
individuals classified into said subset who have a good prognosis or good outcome, while 
said poor prognosis template comprises the average level of expression of each of said 
plurality of genes informative for prognosis in tumor samples from individuals classified into 
said subset who have a poor prognosis or poor outcome. In another specific embodiment, 
said good or poor prognosis template is a set of mathematical values representing the average 
level of expression of genes informative for prognosis in tumor samples of individuals 
classifiable into said subset who have a good prognosis, or who have a poor prognosis, 
respectively. 

[00123] It is evident that the different patient subsets described herein reflect different 
molecular mechanisms of the initiation of tumor formation and metastasis. Thus, the 
genesets listed in tables 1-5 are also useful for diagnosing a person as having a particular type 
of breast cancer in the first instance. Thus, the invention also provides a method of 
diagnosing an individual as having a particular subtype of breast cancer, comprising 
determining an expression profile of a plurality of the genes for which markers are listed in 
Tables 1-5 in a sample from said individual; and comparing said expression profile to a 
template profile, where said template is representative of the expression of said plurality of 
genes in a breast cancer sample of said subtype of cancer, and on the basis of said 
comparison, diagnosing the individual as having said subtype of breast cancer. In a specific 
embodiment, said subtype of cancer is selected from the group consisting of ER~, BRCA1 
type; ER~ sporadic type; ER+, ER/AGE high type; ER+, ER/AGE low, LN+ type; and 
ER/AGE low, LN~ type. In another specific embodiment, said template comprises the 
average levels of expression of a plurality of the genes for which markers are listed in Table 
1, Table 2, Table 3, Table 4 or Table 5. In another specific example, said comparing 
comprises determining the similarity of the expression profile of the genes for which markers 
are listed in each of Tables 1-5 in said sample taken from said individual to a template profile 
comprising levels of expression of the same genes for each of Tables 1-5, and determining 
whether the pattern of expression of said genes in said sample is most similar to the pattern of 
expression of the genes for which markers are listed in Table 1, Table 2, Table 3, Table 4 or 
Table 5. 

[00124] In another embodiment, the invention provides a method of classifying an individual 
as having a good prognosis or a poor prognosis, comprising: (a) classifying said individual as 
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ER~~, BRCA1; ER~, sporadic; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or ER+, 
ER/AGE low, LN _ ; (b) determining an expression profile of a first plurality of genes in a cell 
sample taken from the individual relative to a control, said first plurality of genes comprising 
at least two of the genes corresponding to the markers Table 1 if said individual is classified 
as ER , BRCA1 ; Table 2 if said individual is classified as ER - , sporadic; Table 3 if said 
individual is classified as ER+, ER/AGE high; Table 4 if said individual is classified as ER+, 
ER/AGE low, LN+; or Table 5 if said individual is classified as ER+, ER/AGE low, LN~, 
wherein said individual is "ER/AGE high" if the ER level of the individual exceeds a 
predetermined value, and "ER/AGE low" if the ER level of the individual does not exceed 
said predetermined value. In a specific embodiment of this method, said predetermined value 
of ER is calculated as ER = 0. 1 (AGE - 42.5), wherein AGE is the age of said individual. In 
another specific embodiment, said individual is ER - , BRCA1, and said plurality of genes 
comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for which markers are listed in Table 1. 
In another specific embodiment, said individual is ER~, sporadic, and said plurality of genes 
comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for which markers are listed in Table 2. 
In another specific embodiment, said individual is ER+, ER/AGE high, and said plurality of 
genes comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for which markers are listed in 
Table 3. In another specific embodiment, said individual is ER+, ER/AGE low, LN+, and 
said plurality of genes comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for which markers 
are listed in Table 4. In another specific embodiment, said individual is ER+, ER/AGE low, 
LN~, and said plurality of genes comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for 
which markers are listed in Table 5. In another specific embodiment, the method additionally 
comprises determining in said cell sample the level of expression, relative to a control, of a 
second plurality of genes for which markers are not found in Tables 1-5, wherein said second 
plurality of genes is informative for prognosis. 

[00125] Where information is available regarding the LN status of a breast cancer patient, the 
patient may be identified as having a "very good prognosis," an "intermediate prognosis," or 
a poor prognosis, which enables the refinement of treatment. In one embodiment, the 
invention provides a method of assigning a therapeutic regimen to a breast cancer patient, 
comprising: (a) classifying said patient as having a "poor prognosis," "intermediate 
prognosis," or "very good prognosis" on the basis of the levels of expression of at least five 
genes for which markers are listed in Table 1, Table 2, Table 3, Table 4 or Table 5; and (b) 
assigning said patient a therapeutic regimen, said therapeutic regimen (i) comprising no 
adjuvant chemotherapy if the patient is lymph node negative and is classified as having a 
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good prognosis or an intermediate prognosis, or (ii) comprising chemotherapy if said patient 
has any other combination of lymph node status and expression profile. 
[00126] In another embodiment, a breast cancer patient is assigned a prognosis by a method 
comprising (a) determining the breast cancer patient's age, ER status, LN status and tumor 
type; (b) classifying said patient as ER - sporadic; ER - BRCA1; ER+, ER/AGE high; ER+, 
ER/AGE low, LN+; or ER+, ER/AGE low, LN~; (c) determining an expression profile 
comprising at least five genes in a cell sample taken from said breast cancer patient wherein 
markers for said at least five genes are listed in Table 1 if said patient is classified as ER - , 
sporadic; Table 2 if said patient is classified as ER - , BRCA1; Table 3 if said patient is 
classified as ER+, ER/AGE high; Table 4 if said patient is classified as ER+, ER/AGE low, 
LN+; or Table 5 if said patient is classified as ER+, ER/AGE high, LN - ; (d) determining the 
similarity of the expression profile of said at least five genes to a template profile comprising 
levels of expression of said at least five genes to obtain a patient similarity value; (e) 
comparing said patient similarity value to selected first and second threshold values of 
similarity, respectively, wherein said second similarity threshold indicates greater similarity 
to said template expression profile than does said first similarity threshold; and (f) classifying 
said breast cancer patient as having a first prognosis if said patient similarity value exceeds 
said second threshold similarity values, a second prognosis if said patient similarity value 
exceeds said first threshold similarity value but does not exceed said second threshold 
similarity value, and a third prognosis if said patient similarity value does not exceed said 
first threshold similarity value. In a specific embodiment of the method, said first prognosis 
is a "very good prognosis," said second prognosis is an "intermediate prognosis," and said 
third prognosis is a "poor prognosis," wherein said breast cancer patient is assigned a 
therapeutic regimen comprising no adjuvant chemotherapy if the patient is lymph node 
negative and is classified as having a good prognosis or an intermediate prognosis, or 
comprising chemotherapy if said patient has any other combination of lymph node status and 
expression profile. 

[00127] The invention also provides a method of assigning a therapeutic regimen to a breast 
cancer patient, comprising: (a) determining the lymph node status for said patient; (b) 
determining the expression of at least five genes for which markers are listed in Table 5 in a 
cell sample from said patient, thereby generating an expression profile; (c) classifying said 
patient as having a "poor prognosis," "intermediate prognosis," or "very good prognosis" on 
the basis of said expression profile; and (d) assigning said patient a therapeutic regimen, said 
therapeutic regimen comprising no adjuvant chemotherapy if the patient is lymph node 
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negative and is classified as having a good prognosis or an intermediate prognosis, or 
comprising chemotherapy if said patient has any other combination of lymph node status and 
classification. In a specific embodiment of this method, said therapeutic regimen assigned to 
lymph node negative patients classified as having an "intermediate prognosis" additionally 
comprises adjuvant hormonal therapy. In another specific embodiment of this method, said 
classifying step (c) is carried out by a method comprising: (a) rank ordering in descending 
order a plurality of breast cancer tumor samples that compose a pool of breast cancer tumor 
samples by the degree of similarity between the expression profile of said at least five genes 
in each of said tumor samples and the expression profile of said at least five genes across all 
remaining tumor samples that compose said pool, said degree of similarity being expressed as 
a similarity value; (b) determining an acceptable number of false negatives in said classifying 
step, wherein a false negative is a breast cancer patient for whom the expression levels of said 
at least five genes in said cell sample predicts that said breast cancer patient will have no 
distant metastases within the first five years after initial diagnosis, but who has had a distant 
metastasis within the first five years after initial diagnosis; (c) determining a similarity value 
above which in said rank ordered list said acceptable number of tumor samples or fewer are 
false negatives; (d) selecting said similarity value determined in step (c) as a first threshold 
similarity value; (e) selecting a second similarity value, greater than said first similarity 
value, as a second threshold similarity value; and (f) determining the similarity between the 
expression profile of said at least five genes in a breast cancer tumor sample from the breast 
cancer patient and the expression profile of said respective at least five genes in said pool, to 
obtain a patient similarity value, wherein if said patient similarity value equals or exceeds 
said second threshold similarity value, said patient is classified as having a "very good 
prognosis"; if said patient similarity value equals or exceeds said first threshold similarity 
value, but is less than said second threshold similarity value, said patient is classified as 
having an "intermediate prognosis"; and if said patient similarity value is less than said first 
threshold similarity value, said patient is classified as having a "poor prognosis." Another 
specific embodiment of this method comprises determining the estrogen receptor (ER) status 
of said patient, wherein if said patient is ER positive and lymph node negative, said 
therapeutic regimen assigned to said patient additionally comprises adjuvant hormonal 
therapy. 

[00128] A patient in any patient subset or clinical class, e.g., any one of the classes described 
above, can be classified as having a particular prognosis level, e.g., a good prognosis or a 
poor prognosis, based on the similarity of the patient's cellular constituent profile to an 
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appropriate template profile for the prognosis level of patients in the clinical class. In one 
embodiment, a cellular constituent profile corresponding to a certain prognosis level, e.g., a 
profile comprising measurements of the plurality of cellular constituents representative of 
levels of the cellular constituents in a plurality of patients having the prognosis level is used 
as a template for the prognosis level. For example, a good prognosis template profile 
comprising measurements of the plurality of cellular constituents representative of levels of 
the cellular constituents in a plurality of good outcome patients or a poor prognosis template 
profile comprising measurements of the plurality of cellular constituents representative of 
levels of the cellular constituents in a plurality of poor outcome patients, can be used for 
determining whether a patient have good or poor prognosis. Here, a good outcome patient is 
a patient who has non-reoccurrence of metastases within a period of time after initial 
diagnosis, e.g., a period of 1, 2, 3, 4, 5 or 10 years. In contrast, a poor outcome patient is a 
patient who has reoccurrence of metastases within a period of time after initial diagnosis, e.g., 
a period of 1, 2, 3, 4, 5 or 10 years. In a preferred embodiment, both periods are 10 years. 
Tables 1-5 show exemplary template profiles for the respective patient classes. For example, 
the expression profile of a patient with a combination of ER+, ER/AGE low, LN+ can be 
compared with the good prognosis template of Table 4 to determine if the patient has good 
prognosis or poor prognosis. 

[00129] The degree of similarity of the patient's cellular constituent profile to a template of a 
particular prognosis can be used to indicate whether the patient has the particular prognosis. 
For example, a high degree of similarity indicates that the patient has the particular prognosis, 
whereas a low degree of similarity indicates that the patient does not have the particular 
prognosis. In a preferred embodiment, a patient is classified as having a good prognosis 
profile if the patient's cellular constituent profile has a high similarity to a good prognosis 
template and/or has a low similarity to a poor prognosis template. In another embodiment, a 
patient is classified as having a poor prognosis profile if the patient's cellular constituent 
profile has a low similarity to a good prognosis template and/or has a high similarity to a poor 
prognosis template. In embodiments for predicting the responsiveness of a breast cancer 
patient under the age of 55, the patients in the good and poor outcome patient populations 
used to generate the templates are preferably also under the age of 55 at the time of diagnosis 
of breast cancer. 

[00130] The degree of similarity between a patient's cellular constituent profile and a 
template profile can be determined using any method known in the art. In one embodiment, 
the similarity is represented by a correlation coefficient between the patient's profile and the 
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template. In one embodiment, a correlation coefficient above a correlation threshold 
indicates high similarity, whereas a correlation coefficient below the threshold indicates low 
similarity. In preferred embodiments, the correlation threshold is set as 0.3, 0.4, 0.5 or 0.6. 
In another embodiment, similarity between a patient's profile and a template is represented 
by a distance between the patient's profile and the template. In one embodiment, a distance 
below a given value indicates high similarity, whereas a distance equal to or greater than the 
given value indicates low similarity. 

[00131] As an illustration, in one embodiment, a template for a good prognosis is defined as 
z x (e.g., a profile consisting of the xdev's listed in the good prognosis column of one of 
Tables 1-5) and/or a template for poor prognosis is defined as 2 2 (e.g., a profile consisting of 
the xdev's listed in the poor prognosis column of one of Tables 1-5). Either one or both of 
the two classifier parameters (P x and P 2 ) can then be used to measure degrees of similarities 
between a patient's profile and the respective templates: P l measures the similarity between 
the patient's profile y and the good prognosis template z x , and P 2 measures the similarity 
between y and the poor prognosis template z 2 . In embodiments which employ correlation 
coefficients, the correlation coefficient P t can be calculated as: 

p t =&*mmyi) (4) 

where i = 1 and 2. 

[00132] Thus, in one embodiment, y is classified as a good prognosis profile if P 1 is greater 
than a selected correlation threshold or if P 2 is equal to or less than a selected correlation 
threshold. In another embodiment, y is classified as a poor prognosis profile if P x is less 
than a selected correlation threshold or if P 2 is above a selected correlation threshold. In still 
another embodiment, y is classified as a good prognosis profile if P x is greater than a first 
selected correlation threshold and y is classified as a poor prognosis profile if P 2 is greater 
than a second selected correlation threshold. 

[00133] In a preferred embodiment, the cellular constituent profile is an expression profile 
comprising measurements of a plurality of transcripts (e.g., measured as mRNAs or cDNAs) 
in a sample derived from a patient, e.g., the plurality of transcripts corresponding to the 
markers in all or a portion of one of Tables 1-5. In this embodiment, the good prognosis 
template can be a good prognosis expression template comprising measurements of the 
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plurality of transcripts representative of expression levels of the transcripts in a plurality of 
good prognosis patients, and the poor prognosis template can be a poor prognosis expression 
template comprising measurements of the plurality of transcripts representative of expression 
levels of the transcripts in a plurality of poor prognosis patients. In a preferred embodiment, 
measurement of each transcript in the good or poor prognosis expression template is an 
average of expression levels of the transcript in the plurality of good or poor prognosis 
patients, respectively. 

[00134] In another embodiment, the expression profile is a differential expression profile 
comprising differential measurements of the plurality of transcripts in a sample derived from 
the patient versus measurements of the plurality of transcripts in a control sample. The 
differential measurements can be xdev, log(ratio), error- weighted log(ratio), or a mean 
subtracted log(intensity) (see, e.g., Stoughton et al., PCT publication WO 00/39339, 
published on July 6, 2000; U.S. Patent Application No. 10/848,755, filed May 18, 2004, by 
Mao et al., attorney docket no: 9301-188-999, each of which is incorporated herein by 
reference in its entirety). 

5.4.3 IMPROVING SENSITIVITY TO EXPRESSION LEVEL 
DIFFERENCES 

[00135] In using the markers disclosed herein, and, indeed, using any sets of markers, e.g., to 
compare profiles or to differentiate an individual having one phenotype from another 
individual having a second phenotype, one can compare the profile comprising absolute 
expression levels of the markers in a sample to a template; for example, a template 
comprising the average levels of expression of the markers in a plurality of individuals. To 
increase the sensitivity of the comparison, however, the expression level values are preferably 
transformed in a number of ways. Also, to differentiate an individual having one phenotype 
from another individual having a second phenotype using any sets of markers, one can 
compare the absolute expression of each of the markers in a sample to a control; for example, 
the control can be the average level of expression of each of the markers, respectively, in a 
pool of individuals. 

[00136] For example, the expression level of each of the markers can be normalized by the 
average expression level of all markers the expression level of which is determined, or by the 
average expression level of a set of control genes. Thus, in one embodiment, the markers are 
represented by probes on a microarray, and the expression level of each of the markers is 
normalized by the mean or median expression level across all of the genes represented on the 
microarray, including any non-marker genes. In a specific embodiment, the normalization is 
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carried out by dividing the median or mean level of expression of all of the genes on the 
microarray. In another embodiment, the expression levels of the markers is normalized by 
the mean or median level of expression of a set of control markers. In a specific 
embodiment, the control markers comprise a set of housekeeping genes. In another specific 
embodiment, the normalization is accomplished by dividing by the median or mean 
expression level of the control genes. 

[00137] The sensitivity of a marker-based assay will also be increased if the expression 
levels of individual markers are compared to the expression of the same markers in a pool of 
samples. Preferably, the comparison is to the mean or median expression level of each the 
marker genes in the pool of samples. Such a comparison may be accomplished, for example, 
by dividing by the mean or median expression level of the pool for each of the markers from 
the expression level each of the markers in the sample. This has the effect of accentuating the 
relative differences in expression between markers in the sample and markers in the pool as a 
whole, making comparisons more sensitive and more likely to produce meaningful results 
that the use of absolute expression levels alone. The expression level data may be 
transformed in any convenient way; preferably, the expression level data for all is log 
transformed before means or medians are taken. 

[00138] In performing comparisons to a pool, two approaches may be used. First, the 
expression levels of the markers in the sample may be compared to the expression level of 
those markers in the pool, where nucleic acid derived from the sample and nucleic acid 
derived from the pool are hybridized during the course of a single experiment. Such an 
approach requires that new pool nucleic acid be generated for each comparison or limited 
numbers of comparisons, and is therefore limited by the amount of nucleic acid available. 
Alternatively, and preferably, the expression levels in a pool, whether normalized and/or 
transformed or not, are stored on a computer, or on computer-readable media, to be used in 
comparisons to the individual expression level data from the sample (i.e., single-channel 
data). 

[00139] The current invention also provides the following method of classifying a first cell or 
organism as having one of at least two different phenotypes, where the different phenotypes 
comprise a first phenotype and a second phenotype. The level of expression of each of a 
plurality of markers in a first sample from the first cell or organism is compared to the level 
of expression of each of said markers, respectively, in a pooled sample from a plurality of 
cells or organisms, the plurality of cells or organisms comprising different cells or organisms 
exhibiting said at least two different phenotypes, respectively, to produce a first compared 
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value. The first compared value is then compared to a second compared value, wherein said 
second compared value is the product of a method comprising comparing the level of 
expression of each of said markers in a sample from a cell or organism characterized as 
having said first phenotype to the level of expression of each of said markers, respectively, in 
the pooled sample. The first compared value is then compared to a third compared value, 
wherein said third compared value is the product of a method comprising comparing the level 
of expression of each of the markers in a sample from a cell or organism characterized as 
having the second phenotype to the level of expression of each of the markers, respectively, 
in the pooled sample. In specific embodiments, the marker can be a gene, a protein encoded 
by the gene, etc. Optionally, the first compared value can be compared to additional 
compared values, respectively, where each additional compared value is the product of a 
method comprising comparing the level of expression of each of said markers in a sample 
from a cell or organism characterized as having a phenotype different from said first and 
second phenotypes but included among the at least two different phenotypes, to the level of 
expression of each of said genes, respectively, in said pooled sample. Finally, a 
determination is made as to which of said second, third, and, if present, one or more 
additional compared values, said first compared value is most similar, wherein the first cell or 
organism is determined to have the phenotype of the cell or organism used to produce said 
compared value most similar to said first compared value. 

[00140] The sensitivity of a marker-based assay will also be increased if the expression 
levels of individual markers are compared to the expression of the same markers in a control 
sample, e.g., a sample comprises a pool of samples, to generate a differential expression 
profile. Such a comparison may be accomplished, for example, by determining a ratio 
between expression level of each marker in the sample and the expression level of the 
corresponding marker in the control sample. This has the effect of accentuating the relative 
differences in expression between markers in the sample and markers in the control as a 
whole, making subsequent comparisons to a template more sensitive and more likely to 
produce meaningful results than the use of absolute expression levels alone. The comparison 
may be performed in any convenient way, e.g., by taking difference, ratio, or log(ratio). 
[00141] In performing comparisons to a control sample, two approaches may be used. First, 
the expression levels of the markers in the sample may be compared to the expression level of 
those markers in the control sample, where nucleic acid derived from the sample and nucleic 
acid derived from the control are hybridized during the course of a single experiment. Such 
an approach requires that new control sample of nucleic acid be generated for each 
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comparison or limited numbers of comparisons, and is therefore limited by the amount of 
nucleic acid available. Alternatively, the expression levels in a control sample, whether 
normalized and/or transformed or not, are stored on a computer, or on computer-readable 
media, to be used in comparisons to the individual expression level data from the sample (i.e., 
single-channel data). 

[00142] The methods of the invention preferably use a control or reference sample, which 
can be any suitable sample against which changes in cellular constituents can be determined. 
In one embodiment, the control or reference sample is generated by pooling together the 
plurality of cellular constituents, e.g., a plurality of transcripts or cDNAs, or a plurality of 
protein species, from a plurality of breast cancer patients. Alternatively, the control or 
reference sample can be generated by pooling together purified or synthesized cellular 
constituents, e.g., a plurality of purified or synthesized transcripts or cDNAs, a plurality of 
purified or synthesized protein species. In one embodiment, synthetic RNAs for each 
transcripts or cDNAs are pooled to form the control or reference sample. Preferably, the 
abundances of synthetic RNAs are approximately the abundances of the corresponding 
transcripts in a real tumor pool. The differential expression of marker genes for each 
individual patient sample is measured against this control sample. In one embodiment, 60- 
mer oligonucleotides corresponding to the probe sequences on a microarray used to assay the 
expression levels of the diagnostic/prognostic transcripts are synthesized and cloned into 
pBluescript SK- vector (Statagene, La Jolla, CA), adjacent to the T7 promotor sequence. 
Individual clones are isolated, and the sequences of their inserts are verified by DNA 
sequencing. To generate synthetic RNAs, clones are linearized with .EcoRI and a T7 in vitro 
transcription (IVT) reaction is performed by MegaScript kit (Ambion, Austin, TX), followed 
by DNase treatment of the product. Synthetic RNAs are purified on RNeasy columns 
(Qiagen, Valencia, CA). These synthetic RNAs are transcribed, amplified, labeled, and 
mixed together to make the reference pool. The abundance of those synthetic RNAs are 
chosen to approximate the abundances of the transcripts of the corresponding marker genes in 
the real tumor pool. 

[00143] The current invention provides the following method of classifying a first cell or 
organism as having one of at least two different phenotypes, where the different phenotypes 
comprise a first phenotype and a second phenotype. The level of expression of each of a 
plurality of markers in a first sample from the first cell or organism is compared to the level 
of expression of each of said markers, respectively, in a pooled sample from a plurality of 
cells or organisms, the plurality of cells or organisms comprising different cells or organisms 
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exhibiting said at least two different phenotypes, respectively, to produce a first compared 
value so that a first differential profile comprising a plurality of first compared values for said 
plurality of markers is generated. The first differential profile is then compared to a second 
differential profile comprising a plurality of second compared values, wherein each said 
second compared value is the product of a method comprising comparing the level of 
expression of each of said markers in a sample from a cell or organism characterized as 
having said first phenotype to the level of expression of each of said markers, respectively, in 
the pooled sample. The first differential profile is then compared to a third differential profile 
comprising a plurality of third compared values, wherein each said third compared value is 
the product of a method comprising comparing the level of expression of each of the markers 
in a sample from a cell or organism characterized as having the second phenotype to the level 
of expression of each of the markers, respectively, in the pooled sample. In specific 
embodiments, each marker can be a gene, a protein encoded by the gene, etc. Optionally, the 
first differential profile can be compared to additional expression profiles each of which 
comprising additional compared values, respectively, where each additional compared value 
is the product of a method comprising comparing the level of expression of each of said 
markers in a sample from a cell or organism characterized as having a phenotype different 
from said first and second phenotypes but included among the at least two different 
phenotypes, to the level of expression of each of said genes, respectively, in said pooled 
sample. Finally, a determination is made as to which of said second, third, and, if present, 
one or more additional differential profiles, said first differential profile is most similar, 
wherein the first cell or organism is determined to have the phenotype of the cell or organism 
used to produce said differential profile most similar to said first differential profile. 
[00144] In a specific embodiment of this method, the compared values are each ratios of the 
levels of expression of each of said genes. In another specific embodiment, each of the levels 
of expression of each of the genes in the pooled sample are normalized prior to any of the 
comparing steps. In a more specific embodiment, the normalization of the levels of 
expression is carried out by dividing by the median or mean level of the expression of each of 
the genes or dividing by the mean or median level of expression of one or more housekeeping 
genes in the pooled sample from said cell or organism. In another specific embodiment, the 
normalized levels of expression are subjected to a log transform, and the comparing steps 
comprise subtracting the log transform from the log of the levels of expression of each of the 
genes in the sample. In another specific embodiment, the two or more different phenotypes 
are different stages of a disease or disorder. In still another specific embodiment, the two or 
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more different phenotypes are different prognoses of a disease or disorder. In yet another 
specific embodiment, the levels of expression of each of the genes, respectively, in the pooled 
sample or said levels of expression of each of said genes in a sample from the cell or 
organism characterized as having the first phenotype, second phenotype, or said phenotype 
different from said first and second phenotypes, respectively, are stored on a computer or on 
a computer-readable medium. 

[00145] In another specific embodiment, the two phenotypes are good prognosis and poor 
prognosis. In a more specific embodiment, the two phenotypes are good prognosis and poor 
prognosis for an individual that is identified as having ER~ BRCA1 status, ER~~, sporadic 
status, ER+, ER/AGE high status, ER+, ER/AGE low, LN+ status, or ER+, ER/AGE low, 
LN+ status. 

[00146] In another specific embodiment, the comparison is made between the expression 
profile of the genes in the sample and the expression profile of the same genes in a pool 
representing only one of two or more phenotypes. In the context of prognosis-correlated 
genes, for example, one can compare the expression levels of prognosis-related genes in a 
sample to the average levels of the expression of the same genes in a plurality of "good 
prognosis" samples (as opposed to a plurality of samples that include samples from patients 
having poor prognoses and good prognoses). Thus, in this method, a sample is classified as 
having a good prognosis if the expression profile of prognosis-correlated genes exceeds a 
chosen coefficient of correlation to the average "good prognosis" expression profile {e.g., the 
profile comprising average levels of expression of prognosis-correlated genes in samples 
from a plurality of patients having a "good prognosis"). Patients whose expression profiles 
correlate more poorly with the "good prognosis" expression profile {e.g., whose correlation 
coefficient fails to exceed the chosen coefficient) are classified as having a poor prognosis. 
[00147] Where individuals are classified on the basis of phenotypic, genotypic, or clinical 
characteristics into patient subsets, the pool of samples may be a pool of samples for the 
phenotype that includes samples representing each of the patient subsets. Alternatively, the 
pool of samples may be a pool of samples for the phenotype representing only the specific 
patient subset. For example, where an individual is classified as ER+, sporadic, the pool of 
samples to which the individual's sample is compared may be a pool of samples from ER+, 
sporadic individuals having a good prognosis only, or may be a pool of samples of 
individuals having a good prognosis, without regard to ER status or mutation type. 
[00148] The method can be applied to a plurality of patient subsets. For example, in a 
specific embodiment, the phenotype is good prognosis, and the individual is classified into 

66 



WO 2005/086891 



PCT/US2005/007894 



one of the following patient subsets: ER~ BRCA1 status, ER~ sporadic status, ER-f, 
ER/AGE high status, ER+ 3 ER/AGE low, LN+ status, or ER+, ER/AGE low, LN+ status. A 
set of markers informative for prognosis for the patient subset into which the individual is 
classified is then used to determine the likely prognosis for the individual. A sample is 
classified as coming from an individual having a good prognosis if the expression profile of 
prognosis-correlated genes for the particular subset into which the individual is classified 
exceeds a chosen coefficient of correlation to the average "good prognosis" expression 
profile {e.g., the levels of expression of prognosis-correlated genes in a plurality of samples 
from patients within the subclass having a "good prognosis"). Patients whose expression 
levels correlate more poorly with the "good prognosis" expression profile (e.g., whose 
correlation coefficient fails to exceed the chosen coefficient) are classified as having a poor 
prognosis. 

[00149] Of course, single-channel data may also be used without specific comparison to a 
mathematical sample pool. For example, a sample may be classified as having a first or a 
second phenotype, wherein the first and second phenotypes are related, by calculating the 
similarity between the expression profile of at least 5 markers in the sample, where the 
markers are correlated with the first or second phenotype, to a first phenotype template and a 
second phenotype template each comprising the expression levels of the same markers, by (a) 
labeling nucleic acids derived from a sample with a fluorophore to obtain a pool of 
fluorophore-labeled nucleic acids; (b) contacting said fluorophore-labeled nucleic acid with a 
microarray under conditions such that hybridization can occur, detecting at each of a plurality 
of discrete loci on the microarray a fluorescent emission signal from said fluorophore-labeled 
nucleic acid that is bound to said microarray under said conditions; and (c) determining the 
similarity of marker gene expression in the individual sample to the first and second 
templates, wherein if said expression is more similar to the first template, the sample is 
classified as having the first phenotype, and if said expression is more similar to the second 
template, the sample is classified as having the second phenotype. 
[0100] In a specific embodiment of the above method, the first phenotype is a good 
prognosis of breast cancer, the sample is a sample from an individual that has been classified 
into a patient subset, and the first and second templates are templates for the phenotype for 
the particular patient subset. In a more specific embodiment, for example, the first phenotype 
is a good prognosis, the second phenotype is a poor prognosis, the patient is classified into an 
ER"~, sporadic patient subset, an ER~ BRCA1 subset, an ER+, ER/AGE high subset, an ER+, 
ER/AGE low, LN+ subset, or an ER+, ER/AGE low, LN+ subset, and said first and second 

67 



WO 2005/086891 



PCT/US2005/007894 



templates are templates derived from the expression of the marker genes in individuals 
having a good prognosis and a poor prognosis, respectively, wherein said individuals are all 
of the patient subset into which said patient is classified. 

5.5 DETERMINATION OF MARKER GENE EXPRESSION LEVELS 
5.5.1 METHODS 

[00150] The expression levels of the marker genes in a sample may be determined by any 
means known in the art. The expression level may be determined by isolating and 
determining the level (i.e., amount) of nucleic acid transcribed from each marker gene. 
Alternatively, or additionally, the level of specific proteins encoded by a marker gene may be 
determined. 

[00151] The level of expression of specific marker genes can be accomplished by 
determining the amount of mRNA, or polynucleotides derived therefrom, present in a sample. 
Any method for determining RNA levels can be used. For example, RNA is isolated from a 
sample and separated on an agarose gel. The separated RNA is then transferred to a solid 
support, such as a filter. Nucleic acid probes representing one or more markers are then 
hybridized to the filter by northern hybridization, and the amount of marker-derived RNA is 
determined. Such determination can be visual, or machine-aided, for example, by use of a 
densitometer. Another method of determining RNA levels is by use of a dot-blot or a slot- 
blot. In this method, RNA, or nucleic acid derived therefrom, from a sample is labeled. The 
RNA or nucleic acid derived therefrom is then hybridized to a filter containing 
oligonucleotides derived from one or more marker genes, wherein the oligonucleotides are 
placed upon the filter at discrete, easily-identifiable locations. Hybridization, or lack thereof, 
of the labeled RNA to the filter-bound oligonucleotides is determined visually or by 
densitometer. Polynucleotides can be labeled using a radiolabel or a fluorescent (i.e., visible) 
label. 

[00152] These examples are not intended to be limiting; other methods of determining RNA 
abundance are known in the art. 

[00153] The level of expression of particular marker genes may also be assessed by 
determining the level of the specific protein expressed from the marker genes. This can be 
accomplished, for example, by separation of proteins from a sample on a polyacrylamide gel, 
followed by identification of specific marker-derived proteins using antibodies in a western 
blot. Alternatively, proteins can be separated by two-dimensional gel electrophoresis 
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systems. Two-dimensional gel electrophoresis is well-known in the art and typically involves 
isoelectric focusing along a first dimension followed by SDS-PAGE electrophoresis along a 
second dimension. See, e.g., Hames et al, 1990, Gel Electrophoresis of Proteins: A 
Practical Approach, IRL Press, New York; Shevchenko et al, Proc. Nat 'I Acad. Sci. USA 
93:1440-1445 (1996); Sagliocco et ah, Yeast 12:1519-1533 (1996); Lander, Science 
274:536-539 (1996). The resulting electropherograms can be analyzed by numerous 
techniques, including mass spectrometric techniques, western blotting and immunoblot 
analysis using polyclonal and monoclonal antibodies. 

[00154] Alternatively, marker-derived protein levels can be determined by constructing an 
antibody microarray in which binding sites comprise immobilized, preferably monoclonal, 
antibodies specific to a plurality of protein species encoded by the cell genome. Preferably, 
antibodies are present for a substantial fraction of the marker-derived proteins of interest. 
Methods for making monoclonal antibodies are well known {see, e.g., Harlow and Lane, 
1988, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York, which is 
incorporated in its entirety for all purposes). In one embodiment, monoclonal antibodies are 
raised against synthetic peptide fragments designed based on genomic sequence of the cell. 
With such an antibody array, proteins from the cell are contacted to the array, and their 
binding is assayed with assays known in the art. Generally, the expression, and the level of 
expression, of proteins of diagnostic or prognostic interest can be detected through 
immunohistochemical staining of tissue slices or sections. 

[00155] Finally, expression of marker genes in a number of tissue specimens may be 
characterized using a "tissue array" (Kononen et al, Nat. Med 4(7): 844-7 (1998)). In a tissue 
array, multiple tissue samples are assessed on the same microarray. The arrays allow in situ 
detection of RNA and protein levels; consecutive sections allow the analysis of multiple 
samples simultaneously. 

5.5.2 MICROARRAYS 
[00156] In preferred embodiments, polynucleotide microarrays are used to measure 
expression so that the expression status of each of the markers above is assessed 
simultaneously. Generally, microarrays according to the invention comprise a plurality of 
markers informative for prognosis, or outcome determination, for a particular disease or 
condition, and, in particular, for individuals having specific combinations of genotypic or 
phenotypic characteristics of the disease or condition (i.e., that are prognosis-informative for 
a particular patient subset). 
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[00157] The microarrays of the invention preferably comprise at least 2, 3, 4, 5, 7, 10, 15, 20, 
25, 30, 35, 40, 45, 50, 75, 100, 150, 200 or more of markers, or all of the markers, or any 
combination of markers, identified as prognosis-informative within a patient subset. The 
actual number of informative markers the microarray comprises will vary depending upon the 
particular condition of interest, the number of markers identified, and, optionally, the number 
of informative markers found to result in the least Type I error, Type II error, or Type I and 
Type II error in determination of prognosis. As used herein, "Type I error" means a false 
positive and "Type II error" means a false negative; in the example of prognosis of beast 
cancer, Type I error is the mis-characterization of an individual with a good prognosis as 
having a poor prognosis, and Type II error is the mis-characterization of an individual with a 
poor prognosis as having a good prognosis. 

[00158] In specific embodiments, the invention provides polynucleotide arrays in which the 
prognosis markers identified for a particular patient subset comprise at least 50%, 60%, 70%, 
80%, 85%, 90%, 95% or 98% of the probes on said array. In another specific embodiment, 
the microarray comprises a plurality of probes, wherein said plurality of probes comprise 
probes complementary and hybridizable to at least 75% of the prognosis-informative markers 
identified for a particular patient subset. Microarrays of the invention, of course, may 
comprise probes complementary and hybridizable to prognosis-informative markers for a 
plurality of the patient subsets, or for each patient subset, identified for a particular condition. 
In another embodiment, therefore, the microarray of the invention comprises a plurality of 
probes complementary and hybridizable to at least 75% of the prognosis-informative markers 
identified for each patient subset identified for the condition of interest, and wherein said 
probes, in total, are at least 50% of the probes on said microarray. 
[00159] In yet another specific embodiment, microarrays that are used in the methods 
disclosed herein optionally comprise markers additional to at least some of the markers 
identified by the methods disclosed elsewhere herein. For example, in a specific 
embodiment, the microarray is a screening or scanning array as described in Altschuler et ah, 
International Publication WO 02/18646, published March 7, 2002 and Scherer et ah, 
International Publication WO 02/16650, published February 28, 2002. The scanning and 
screening arrays comprise regularly-spaced, positionally-addressable probes derived from 
genomic nucleic acid sequence, both expressed and unexpressed. Such arrays may comprise 
probes corresponding to a subset of, or all of, the markers identified for the patient subset(s) 
for the condition of interest, and can be used to monitor marker expression in the same way 
as a microarray containing only prognosis-informative markers otherwise identified. 
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[00160] In yet another specific embodiment, the microarray is a commercially-available 
cDNA microarray that comprises at least five markers identified by the methods described 
herein. Preferably, a commercially-available cDNA microarray comprises all of the markers 
identified by the methods described herein as being informative for a patient subset for a 
particular condition. However, such a microarray may comprise at least 5, 10, 15 or 25 of 
such markers, up to the maximum number of markers identified. 
[00161] In an embodiment specific to breast cancer, the invention provides for 
oligonucleotide or cDNA arrays comprising probes hybridizable to the genes corresponding 
to each of the marker sets described above (i.e., markers informative for ER~, sporadic 
individuals, markers informative for ERT, BRCA1 individuals, markers informative for ER+, 
ER/AGE high individuals, markers informative for ER-h, ER/AGE low, LN+ individuals, and 
markers informative for ER+, ER/AGE low, LN~ individuals, as shown in Tables 1-5). Any 
of the microarrays described herein may be provided in a sealed container in a kit. 
[00162] The invention provides microarrays containing probes useful for the prognosis of 
any breast cancer patient, or for breast cancer patients classified into one of a plurality of 
patient subsets. In particular, the invention provides polynucleotide arrays comprising probes 
to a subset or subsets of at least 5, 10, 1 5, 20, 25 or more of the genetic markers, or up to the 
full set of markers, in any of Tables 1-5, which distinguish between patients with good and 
poor prognosis. In certain embodiments, therefore, the invention provides microarrays 
comprising probes for a plurality of the genes for which markers are listed in Tables 1, 2, 3, 4 
or 5. In a specific embodiment, the microarray of the invention comprises 1, 2, 3, 4, 5 or 10 
of the markers in Table 1, at least five of the markers in Table 2; 1, 2, 3, 4, 5 or 10 of the 
markers in Table 3; 1, 2, 3, 4, 5 or 10 of the markers in Table 4; or 1, 2, 3, 4, 5 or 10 of the 
markers in Table 1. In other embodiments, the microarray comprises probes for 1, 2, 3, 4, 5, 
or 10 of the markers shown in any two, three or four of Tables 1-5, or all of Tables 1-5. In 
other embodiments, the microarray of the invention contains each of the markers in Table 1, 
Table 2, Table 3, Table 4, or Table 5. In another embodiment, the microarray contains all of 
the markers shown in Tables 1-5. In specific embodiments, the array comprises probes 
derived only from the markers listed in Table 1, Table 2, Table 3, Table 4, or Table 5; probes 
derived from any two of Tables 1-5; any three of Tables 1-5; any four of Tables 1-5; or all of 
Tables 1-5. 

[00163] In other embodiments, the array comprises a plurality of probes derived from 
markers listed in any of Tables 1-5 in combination with a plurality of other probes, derived 
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from markers not listed in any of Tables 1-5, that are identified as informative for the 
prognosis of breast cancer. 

[00164] In specific embodiments, the invention provides polynucleotide arrays in which the 
breast cancer prognosis markers described herein in Tables 1 5 2, 3, 4 and/or 5 comprise at 
least 50%, 60%, 70%, 80%, 85%, 90%, 95% or 98% of the probes on said array. In another 
specific embodiment, the microarray comprises a plurality of probes, wherein said plurality 
of probes comprise probes complementary and hybridizable to at least 75% of the genes for 
which markers are listed in Table 1; probes complementary and hybridizable to at least 75% 
of the genes for which markers are listed in Table 2; probes complementary and hybridizable 
to at least 75% of the genes for which markers are listed in Table 3; probes complementary 
and hybridizable to at least 75% of the genes for which markers are listed in Table 4; and 
probes complementary and hybridizable to at least 75% of the genes for which markers are 
listed in Table 5, wherein said probes, in total, are at least 50% of the probes on said 
microarray. 

[00165] In yet another specific embodiment, microarrays that are used in the methods 
disclosed herein optionally comprise markers additional to at least some of the markers listed 
in Tables 1-5. For example, in a specific embodiment, the microarray is a screening or 
scanning array as described in Altschuler et ah, International Publication WO 02/18646, 
published March 7, 2002 and Scherer et ah, International Publication WO 02/16650, 
published February 28, 2002. The scanning and screening arrays comprise regularly-spaced, 
positionally-addressable probes derived from genomic nucleic acid sequence, both expressed 
and unexpressed. Such arrays may comprise probes corresponding to a subset of, or all of, 
the markers listed in Tables 1-5, or a subset thereof as described above, and can be used to 
monitor marker expression in the same way as a microarray containing only markers listed in 
Tables 1-5. 

[00166] In yet another specific embodiment, the microarray is a commercially-available 
cDNA microarray that comprises at least five of the markers listed in Tables 1-5. Preferably, 
a commercially-available cDNA microarray comprises all of the markers listed in Tables 1-5. 
However, such a microarray may comprise at least 5, 10, 15 or 25 of the markers in any of 
Tables 1-5, up to the maximum number of markers in a Table, and may comprise all of the 
markers in any one of Tables 1-5, and a subset of another of Tables 1-5, or subsets of each as 
described above. In a specific embodiment of the microarrays used in the methods disclosed 
herein, the markers that are all or a portion of Tables 1-5 make up at least 50%, 60%, 70%, 
80%, 90%, 95% or 98% of the probes on the microarray. 
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[00167] General methods pertaining to the construction of microarrays comprising the 

marker sets and/or subsets above are described in the following sections. 

[00168] 

[00169] 

5.5.2. 1 CONSTRUCTION OF MICROARRAYS 
[00170] Microarrays are prepared by selecting probes which comprise a polynucleotide 
sequence, and then immobilizing such probes to a solid support or surface. For example, the 
probes may comprise DNA sequences, RNA sequences, or copolymer sequences of DNA and 
RNA. The polynucleotide sequences of the probes may also comprise DNA and/or RNA 
analogues, or combinations thereof. For example, the polynucleotide sequences of the probes 
may be full or partial fragments of genomic DNA. The polynucleotide sequences of the 
probes may also be synthesized nucleotide sequences, such as synthetic oligonucleotide 
sequences. The probe sequences can be synthesized either enzymatically in vivo, 
enzymatically in vitro {e.g., by PCR), or non-enzymatically in vitro. 

[00171] The probe or probes used in the methods of the invention are preferably immobilized 
to a solid support which may be either porous or non-porous. For example, the probes of the 
invention may be polynucleotide sequences which are attached to a nitrocellulose or nylon 
membrane or filter covalently at either the 3' or the 5' end of the polynucleotide. Such 
hybridization probes are well known in the art (see, e.g., Sambrook et al, Molecular 
Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York (1989). Alternatively, the solid support or surface may be a 
glass or plastic surface. In a particularly preferred embodiment, hybridization levels are 
measured to microarrays of probes consisting of a solid phase on the surface of which are 
immobilized a population of polynucleotides, such as a population of DNA or DNA mimics, 
or, alternatively, a population of RNA or RNA mimics. The solid phase may be a nonporous 
or, optionally, a porous material such as a gel. 

[00172] In preferred embodiments, a microarray comprises a support or surface with an 
ordered array of binding {e.g., hybridization) sites or "probes" each representing one of the 
markers described herein. Preferably the microarrays are addressable arrays, and more 
preferably positionally addressable arrays. More specifically, each probe of the array is 
preferably located at a known, predetermined position on the solid support such that the 
identity {i.e., the sequence) of each probe can be determined from its position in the array 
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(i.e., on the support or surface). In preferred embodiments, each probe is covalently attached 
to the solid support at a single site. 

[00173] Microarrays can be made in a number of ways, of which several are described 
below. However produced, microarrays share certain characteristics. The arrays are 
reproducible, allowing multiple copies of a given array to be produced and easily compared 
with each other. Preferably, microarrays are made from materials that are stable under 
binding (e.g., nucleic acid hybridization) conditions. The microarrays are preferably small, 
e.g., between 1 cm 2 and 25 cm 2 , between 12 cm 2 and 13 cm 2 , or 3 cm 2 . However, larger 
arrays are also contemplated and may be preferable, e.g., for use in screening arrays. 
Preferably, a given binding site or unique set of binding sites in the microarray will 
specifically bind (e.g., hybridize) to the product of a single gene in a cell (e.g., to a specific 
mRNA, or to a specific cDNA derived therefrom). However, in general, other related or 
similar sequences will cross hybridize to a given binding site. 

[00174] The microarrays of the present invention include one or more test probes, each of 
which has a polynucleotide sequence that is complementary to a subsequence of RNA or 
DNA to be detected. Preferably, the position of each probe on the solid surface is know. 
Indeed, the microarrays are preferably positionally addressable arrays. Specifically, each 
probe of the array is preferably located at a known, predetermined position on the solid 
support such that the identity (i.e., the sequence) of each probe can be determined from its 
position on the array (i.e., on the support or surface). 

[00175] According to the invention, the microarray is an array (i.e., a matrix) in which each 
position represents one of the markers described herein. For example, each position can 
contain a DNA or DNA analogue based on genomic DNA to which a particular RNA or 
cDNA transcribed from that genetic marker can specifically hybridize. The DNA or DNA 
analogue can be, e.g., a synthetic oligomer or a gene fragment. In one embodiment, probes 
representing each of the markers is present on the array. In a preferred embodiment, the array 
comprises probes for each of the markers listed in Tables 1-5. 

5.5.2.2 PREPARING PROBES FOR MICROARRAYS 
[00176] As noted above, the "probe" to which a particular polynucleotide molecule 
specifically hybridizes according to the invention contains a complementary genomic 
polynucleotide sequence. The probes of the microarray preferably consist of nucleotide 
sequences of no more than 1,000 nucleotides. In some embodiments, the probes of the array 
consist of nucleotide sequences of 10 to 1,000 nucleotides. In a preferred embodiment, the 
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nucleotide sequences of the probes are in the range of 10-200 nucleotides in length and are 
genomic sequences of a species of organism, such that a plurality of different probes is 
present, with sequences complementary and thus capable of hybridizing to the genome of 
such a species of organism, sequentially tiled across all or a portion of such genome. In other 
specific embodiments, the probes are in the range of 10-30 nucleotides in length, in the range 
of 10-40 nucleotides in length, in the range of 20-50 nucleotides in length, in the range of 40- 
80 nucleotides in length, in the range of 50-150 nucleotides in length, in the range of 80-120 
nucleotides in length, and most preferably are 60 nucleotides in length. 
[00177] The probes may comprise DNA or DNA "mimics" (e.g., derivatives and analogues) 
corresponding to a portion of an organism's genome. In another embodiment, the probes of 
the microarray are complementary RNA or RNA mimics. DNA mimics are polymers 
composed of subunits capable of specific, Watson-Crick-like hybridization with DNA, or of 
specific hybridization with RNA. The nucleic acids can be modified at the base moiety, at 
the sugar moiety, or at the phosphate backbone. Exemplary DNA mimics include, e.g., 
phosphorothioates . 

[001 78] DNA can be obtained, e.g., by polymerase chain reaction (PCR) amplification of 
genomic DNA or cloned sequences. PCR primers are preferably chosen based on a known 
sequence of the genome that will result in amplification of specific fragments of genomic 
DNA. Computer programs that are well known in the art are useful in the design of primers 
with the required specificity and optimal amplification properties, such as Oligo version 5.0 
(National Biosciences). Typically each probe on the microarray will be between 10 bases 
and 50,000 bases, usually between 300 bases and 1,000 bases in length. PCR methods are 
well known in the art, and are described, for example, in Innis et al, eds., PCR Protocols: 
A Guide to Methods and Applications, Academic Press Inc., San Diego, CA (1990). It 
will be apparent to one skilled in the art that controlled robotic systems are useful for 
isolating and amplifying nucleic acids. 

[00179] An alternative, preferred means for generating the polynucleotide probes of the 
microarray is by synthesis of synthetic polynucleotides or oligonucleotides, e.g., using N- 
phosphonate or phosphoramidite chemistries (Froehler et al, Nucleic Acid Res. 14:5399-5407 
(1986); McBride et al, Tetrahedron Lett. 24:246-248 (1983)). Synthetic sequences are 
typically between about 10 and about 500 bases in length, more typically between about 20 
and about 100 bases, and most preferably between about 40 and about 70 bases in length. In 
some embodiments, synthetic nucleic acids include non-natural bases, such as, but by no 
means limited to, inosine. As noted above, nucleic acid analogues may be used as binding 
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sites for hybridization. An example of a suitable nucleic acid analogue is peptide nucleic acid 
(see, e.g., Egholm etal, Nature 363:566-568 (1993); U.S. Patent No. 5,539,083). 
[00180] Probes are preferably selected using an algorithm that takes into account binding 
energies, base composition, sequence complexity, cross-hybridization binding energies, and 
secondary structure. See Friend et al, International Patent Publication WO 01/05935, 
published January 25, 2001; Hughes et ah, Nat. Biotech. 19:342-7 (2001). 
[00181] A skilled artisan will also appreciate that positive control probes, e.g., probes known 
to be complementary and hybridizable to sequences in the target polynucleotide molecules, 
and negative control probes, e.g., probes known to not be complementary and hybridizable to 
sequences in the target polynucleotide molecules, should be included on the array. In one 
embodiment, positive controls are synthesized along the perimeter of the array. In another 
embodiment, positive controls are synthesized in diagonal stripes across the array. In still 
another embodiment, the reverse complement for each probe is synthesized next to the 
position of the probe to serve as a negative control. In yet another embodiment, sequences 
from other species of organism are used as negative controls or as "spike-in" controls. 

5.5.2.3 ATTACHING PROBES TO THE SOLID SURFACE 
[00182] The probes are attached to a solid support or surface, which may be made, e.g., from 
glass, plastic (e.g., polypropylene, nylon), polyacrylamide, nitrocellulose, gel, or other porous 
or nonporous material. A preferred method for attaching the nucleic acids to a surface is by 
printing on glass plates, as is described generally by Schena et al, Science 270:467-470 

(1995) . This method is especially useful for preparing microarrays of cDNA (See also, 
DeRisi et al, Nature Genetics 14:457-460 (1996); Shalon et al, Genome Res. 6 :639-645 

(1996) ; and Schena et al, Proc. Natl Acad. Sci. U.S.A. 93:10539-1 1286 (1995)). 
[00183] A second preferred method for making microarrays is by making high-density 
oligonucleotide arrays. Techniques are known for producing arrays containing thousands of 
oligonucleotides complementary to defined sequences, at defined locations on a surface using 
photolithographic techniques for synthesis in situ (see, Fodor et al, 1991, Science 251:767- 
773; Pease et al, 1994, Proc. Natl Acad. Sci. U.S.A. 91:5022-5026; Lockhart etal, 1996, 
Nature Biotechnology 14:1675; U.S. Patent Nos. 5,578,832; 5,556,752; and 5,510,270) or 
other methods for rapid synthesis and deposition of defined oligonucleotides (Blanchard et 
al, Biosensors & Bioelectronics 1 1 :687-690). When these methods are used, 
oligonucleotides (e.g., 60-mers) of known sequence are synthesized directly on a surface such 
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as a derivatized glass slide. Usually, the array produced is redundant, with several 
oligonucleotide molecules per RNA. 

[00184] Other methods for making microarrays, e.g., by masking (Maskos and Southern, 
1992, Nuc. Acids. Res. 20:1679-1684), may also be used. In principle, and as noted supra, 
any type of array, for example, dot blots on a nylon hybridization membrane (see Sambrook 
et ah, Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York (1989)) could be used. However, as will 
be recognized by those skilled in the art, very small arrays will frequently be preferred 
because hybridization volumes will be smaller. 

[001 85] In one embodiment, the arrays of the present invention are prepared by synthesizing 
polynucleotide probes on a support. In such an embodiment, polynucleotide probes are 
attached to the support covalently at either the 3' or the 5' end of the polynucleotide. 
[00 1 86] In a particularly preferred embodiment, microarrays of the invention are 
manufactured by means of an ink jet printing device for oligonucleotide synthesis, e.g., using 
the methods and systems described by Blanchard in U.S. Pat. No. 6,028,189; Blanchard et 
al., 1996, Biosensors and Bioelectronics 1 1 :687-690; Blanchard, 1998, in Synthetic DNA 
Arrays in Genetic Engineering, Vol. 20, J.K. Setlow, Ed., Plenum Press, New York at pages 
1 1 1-123. Specifically, the oligonucleotide probes in such microarrays are preferably 
synthesized in arrays, e.g., on a glass slide, by serially depositing individual nucleotide bases 
in "microdroplets" of a high surface tension solvent such as propylene carbonate. The 
microdroplets have small volumes (e.g., 100 pL or less, more preferably 50 pL or less) and 
are separated from each other on the microarray (e.g., by hydrophobic domains) to form 
circular surface tension wells which define the locations of the array elements (i.e., the 
different probes). Microarrays manufactured by this ink-jet method are typically of high 
density, preferably having a density of at least about 2,500 different probes per 1 cm 2 . The 
polynucleotide probes are attached to the support covalently at either the 3' or the 5' end of 
the polynucleotide. 

5.5.2.4 TARGET POLYNUCLEOTIDE MOLECULES 
[00187] The polynucleotide molecules which may be analyzed by the present invention (the 
"target polynucleotide molecules") may be from any clinically relevant source, but are 
expressed RNA or a nucleic acid derived therefrom (e.g., cDNA or amplified RNA derived 
from cDNA that incorporates an RNA polymerase promoter), including naturally occurring 
nucleic acid molecules, as well as synthetic nucleic acid molecules. In one embodiment, the 
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target polynucleotide molecules comprise RNA, including, but by no means limited to, total 
cellular RNA, poly(A) + messenger RNA (mRNA) or fraction thereof, cytoplasmic mRNA, or 
RNA transcribed from cDNA {i.e., cRNA; see, e.g., Linsley & Schelter, U.S. Patent 
Application No. 09/411,074, filed October 4, 1999, or U.S. Patent Nos. 5,545,522, 5,891,636, 
or 5,716,785). Methods for preparing total and poly(A) + RNA are well known in the art, and 
are described generally, e.g., in Sambrook et at, Molecular Cloning - A Laboratory 
Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York (1989). In one embodiment, RNA is extracted from cells of the various types of interest 
in this invention using guanidinium thiocyanate lysis followed by CsCl centrifugation 
(Chirgwin et al, 1979, Biochemistry 18:5294-5299). In another embodiment, total RNA is 
extracted using a silica gel-based column, commercially available examples of which include 
RNeasy (Qiagen, Valencia, California) and StrataPrep (Stratagene, La Jolla, California). In 
an alternative embodiment, which is preferred for S. cerevisiae, RNA is extracted from cells 
using phenol and chloroform, as described in Ausubel et al, eds., 1989, Current 
Protocols in Molecular Biology, Vol. Ill, Green Publishing Associates, Inc., John Wiley 
& Sons, Inc., New York, at pp. 13.12.1-13.12.5). Poly(A) + RNA can be selected, e.g., by 
selection with oligo-dT cellulose or, alternatively, by oligo-dT primed reverse transcription of 
total cellular RNA. In one embodiment, RNA can be fragmented by methods known in the 
art, e.g., by incubation with ZnCl 2 , to generate fragments of RNA. In another embodiment, 
the polynucleotide molecules analyzed by the invention comprise cDNA, or PGR products of 
amplified RNA or cDNA. 

[00188] In one embodiment, total RNA, mRNA, or nucleic acids derived therefrom, is 
isolated from a sample taken from a person afflicted with breast cancer. Target 
polynucleotide molecules that are poorly expressed in particular cells may be enriched using 
normalization techniques (Bonaldo etal, 1996, Genome Res. 6:791-806). 
[00189] As described above, the target polynucleotides are detectably labeled at one or more 
nucleotides. Any method known in the art may be used to detectably label the target 
polynucleotides. Preferably, this labeling incorporates the label uniformly along the length of 
the RNA, and more preferably, the labeling is carried out at a high degree of efficiency. One 
embodiment for this labeling uses oligo-dT primed reverse transcription to incorporate the 
label; however, conventional methods of this method are biased toward generating 3' end 
fragments. Thus, in a preferred embodiment, random primers (e.g., 9-mers) are used in 
reverse transcription to uniformly incorporate labeled nucleotides over the full length of the 
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target polynucleotides. Alternatively, random primers may be used in conjunction with PCR 
methods or T7 promoter-based in vitro transcription methods in order to amplify the target 
polynucleotides. 

[00190] In a preferred embodiment, the detectable label is a luminescent label. For example, 
fluorescent labels, bioluminescent labels, chemiluminescent labels, and colorimetric labels 
may be used in the present invention. In a highly preferred embodiment, the label is a 
fluorescent label, such as a fluorescein, a phosphor, a rhodamine, or a polymethine dye 
derivative. Examples of commercially available fluorescent labels include, for example, 
fluorescent phosphoramidites such as FluorePrime (Amersham Pharmacia, Piscataway, N.J.), 
Fluoredite (Millipore, Bedford, Mass.), FAM (ABI, Foster City, Calif), and Cy3 or Cy5 
(Amersham Pharmacia, Piscataway, N.J.). In another embodiment, the detectable label is a 
radiolabeled nucleotide. 

[00191] In a further preferred embodiment, target polynucleotide molecules from a patient 
sample are labeled differentially from target polynucleotide molecules of a standard. The 
standard can comprise target polynucleotide molecules from normal individuals {i.e., those 
not afflicted with breast cancer). In a highly preferred embodiment, the standard comprises 
target polynucleotide molecules pooled from samples from normal individuals or tumor 
samples from individuals having sporadic-type breast tumors. In another embodiment, the 
target polynucleotide molecules are derived from the same individual, but are taken at 
different time points, and thus indicate the efficacy of a treatment by a change in expression 
of the markers, or lack thereof, during and after the course of treatment (i.e., chemotherapy, 
radiation therapy or cryotherapy), wherein a change in the expression of the markers from a 
poor prognosis pattern to a good prognosis pattern indicates that the treatment is efficacious. 
In this embodiment, different timepoints are differentially labeled. 

5.5.2.5 HYBRIDIZATION TO MICROARRAYS 
[00192] Nucleic acid hybridization and wash conditions are chosen so that the target 
polynucleotide molecules specifically bind or specifically hybridize to the complementary 
polynucleotide sequences of the array, preferably to a specific array site, wherein its 
complementary DNA is located. 

[00193] Arrays containing double-stranded probe DNA situated thereon are preferably 
subjected to denaturing conditions to render the DNA single-stranded prior to contacting with 
the target polynucleotide molecules. Arrays containing single-stranded probe DNA (e.g., 
synthetic oligodeoxyribonucleic acids) may need to be denatured prior to contacting with the 
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target polynucleotide molecules, e.g., to remove hairpins or dimers which form due to self 
complementary sequences. 

[00194] Optimal hybridization conditions will depend on the length (e.g., oligomer versus 
polynucleotide greater than 200 bases) and type (e.g., RNA, or DNA) of probe and target 
nucleic acids. One of skill in the art will appreciate that as the oligonucleotides become 
shorter, it may become necessary to adjust their length to achieve a relatively uniform 
melting temperature for satisfactory hybridization results. General parameters for specific 
(i.e., stringent) hybridization conditions for nucleic acids are described in Sambrook et al, 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York (1989), and in Ausubel et al, Current 
Protocols in Molecular Biology, vol. 2, Current Protocols Publishing, New York 
(1994). Typical hybridization conditions for the cDNA microarrays of Schena et al. are 
hybridization in 5 X SSC plus 0.2% SDS at 65°C for four hours, followed by washes at 25°C 
in low stringency wash buffer (1 X SSC plus 0.2% SDS), followed by 10 minutes at 25°C in 
higher stringency wash buffer (0.1 X SSC plus 0.2% SDS) (Schena et al, Proc. Natl Acad. 
Set U.S.A. 93:10614 (1993)). Useful hybridization conditions are also provided in, e.g., 
Tijessen, 1993, Hybridization With Nucleic Acid Probes, Elsevier Science Publishers 
B.V.; and Kricka, 1992, Nonisotopic DNA Probe Techniques , Academic Press, San 
Diego, CA. 

[00195] Particularly preferred hybridization conditions include hybridization at a temperature 
at or near the mean melting temperature of the probes (e.g., within 51°C, more preferably 
within 21°C) in 1 M NaCl, 50 mM MES buffer (pH 6.5), 0.5% sodium sarcosine and 30% 
forrnamide. 

5.5.2.6 SIGNAL DETECTION AND DATA ANALYSIS 
[00196] When fluorescently labeled probes are used, the fluorescence emissions at each site 
of a microarray may be, preferably, detected by scanning confocal laser microscopy. In one 
embodiment, a separate scan, using the appropriate excitation line, is carried out for each of 
the two fluorophores used. Alternatively, a laser may be used that allows simultaneous 
specimen illumination at wavelengths specific to the two fluorophores and emissions from 
the two fluorophores can be analyzed simultaneously (see Shalon et al, 1996, "A DNA 
microarray system for analyzing complex DNA samples using two-color fluorescent probe 
hybridization," Genome Research 6:639-645, which is incorporated by reference in its 
entirety for all purposes). In a preferred embodiment, the arrays are scanned with a laser 
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fluorescent scanner with a computer controlled X-Y stage and a microscope objective. 
Sequential excitation of the two fluorophores is achieved with a multi-line, mixed gas laser 
and the emitted light is split by wavelength and detected with two photomultiplier tubes. 
Fluorescence laser scanning devices are described in Schena et al, Genome Res, 6:639-645 
(1996), and in other references cited herein. Alternatively, the fiber-optic bundle described 
by Ferguson et al. 9 Nature Biotech 14:1681-1684 (1996), may be used to monitor mRNA 
abundance levels at a large number of sites simultaneously. 

[00197] Signals are recorded and, in a preferred embodiment, analyzed by computer, e.g., 
using a 12 or 16 bit analog to digital board. In one embodiment the scanned image is 
despeckled using a graphics program {e.g., Hijaak Graphics Suite) and then analyzed using an 
image gridding program that creates a spreadsheet of the average hybridization at each 
wavelength at each site. If necessary, an experimentally determined correction for "cross 
talk" (or overlap) between the channels for the two fluors may be made. For any particular 
hybridization site on the transcript array, a ratio of the emission of the two fluorophores can 
be calculated. The ratio is independent of the absolute expression level of the cognate gene, 
but is useful for genes whose expression is significantly modulated in association with the 
different breast cancer-related condition. 

5.6 THERAPEUTIC REGIMENS SPECIFIC TO PATIENT SUBSETS 
[00198] The benefit of identifying subsets of individuals that have a common condition, 
followed by identification of sets of genes informative for those particular subsets of 
individuals, is that such subdivision and identification tends to more accurately identify the 
subset of genes responsible for, or most closely associated with, a particular form of the 
condition. For example, breast cancer is a complex condition brought about by several 
different molecular mechanisms. ER+ individuals, particularly ER+, ER/AGE high 
individuals, show an increased level of expression of cell cycle-control genes, and the 
expression of these genes is highly informative for prognosis in this patient subset (see 
Examples). In ER~~ individuals, however, the expression of these genes is not informative for 
prognosis. 

[00199] The set of informative markers, therefore, can be used to assign a particular course 
of therapy to an individual, e.g., an individual having breast cancer, depending upon the 
condition subset into which the individual is classified. In one embodiment, therefore, the 
invention provides a method of assigning a course of therapy to an individual having a 
condition, said method comprising classifying the individual into one of a plurality of subsets 
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of a condition, wherein a plurality of informative genes has been identified for at least one of 
said subsets; and assigning a course of therapy known or suspected to be effective for treating 
the subset of the condition associated with those genes. In a specific embodiment, said 
condition is breast cancer, said patient subset is ER+, ER/AGE high status, and said course of 
therapy comprises the administration of one or more compounds known or suspected to be 
effective at arresting the cell cycle. In a more specific embodiment, said one or more 
compounds comprises taxol or a vinca alkaloid. 

[00200] Of course, any course of therapy selected or assigned on the basis of the above 
phenotypes and gene expression may be supplemented by other treatments or courses of 
therapy relevant to or known or suspected to be effective in the treatment of the condition. 
For example, the treatment of breast cancer may additionally comprise surgery, either tissue- 
preserving or radical, radiation treatment, chemotherapy other than that suggested by gene 
expression analysis, or any other therapy or treatment known or suspected to be effective. 

5 .7 CLINICAL TRIALS AND EPIDEMIOLOGICAL STUDIES 
[00201] The method of the present invention may also be used to assign individuals to 
categories within a clinical trial, epidemiological study or the like. For example, individuals 
may be distinguished according to a characteristic of a condition, such as the presence or 
absence of specific proteins (e.g., estrogen receptor) or tissue structures (e.g., lymph nodes), 
and with prognosis, and the results of the trial correlated with prognosis. In a specific 
example, the condition is breast cancer, the characteristic is the presence of the estrogen 
receptor, and the outcome is prognosis is the expected reoccurrence or non-reoccurrence of 
metastases within a given period, for example, five years, after initial diagnosis. In another 
specific example, the condition is obesity, the characteristics are 24-hour energy expenditure, 
and the prognosis is the expected occurrence of heart disease or diabetes. In another specific 
example, the condition is a neurodegenerative disease, the characteristic is exposure to a 
particular range of concentration of an environmental toxin, and the prognosis is expected 
occurrence or degree of loss of motor function. In each case, the characteristics and expected 
outcome are used to assign the individual to a category within a clinical trial or 
epidemiological study. 

[00202] Thus, the invention provides a method for assigning an individual to one of a 
plurality of categories in a clinical trial, comprising classifying the individual into one of a 
plurality of condition categories differentiated by at least one genotypic or phenotypic 
characteristic of the condition; determining the level of expression, in a sample derived from 
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said individual, of a plurality of genes informative for said condition category; determining 
whether said level of expression of said plurality of genes indicates that the individual has a 
good prognosis or a poor prognosis; and assigning the individual to a category in a clinical 
trial on the basis of prognosis. 

[00203] In a specific embodiment, the invention provides a method of assigning an 
individual to a category in a breast cancer clinical trial, said method comprising: (a) 
classifying said individual as ER~, BRCA1, ERT, sporadic; ER+, ER/AGE high; ER+, 
ER/AGE low, LN+; or ER+, ER/AGE low, LN~; (b) determining for said individual the level 
of expression of at least two genes for which markers are listed in Table 1 if said individual 
is classified as ER", BRCA1; Table 2 if said individual is classified as ER", sporadic; Table 3 
if said individual is classified as ER+, ER/AGE high; Table 4 if said individual is classified 
as ER+, ER/AGE low, LN+; or Table 5 if said individual is classified as ER+, ER/AGE low, 
LIST; (c) determining whether said individual has a pattern of expression of said at least two 
genes that correlates with a good prognosis or a poor prognosis; and (d) assigning said 
individual to at least one category in a clinical trial if said individual has a good prognosis, 
and assigning said individual to a second category in said clinical trial if said individual has a 
poor prognosis. In a more specific embodiment, said individual is additionally assigned to a 
category in said clinical trial on the basis of the classification of said individual as determined 
in step (a). In another more specific embodiment, said individual is additionally assigned to a 
category in said clinical trial on the basis of any other clinical, phenotypic or genotypic 
characteristic of breast cancer. In another more specific embodiment, the method 
additionally comprises determining in said cell sample the level of expression, relative to a 
control, of a second plurality of genes for which markers are not found in Tables 1-5, wherein 
said second plurality of genes is informative for prognosis of breast cancer, and determining 
from the expression of said second plurality of genes, in addition to said first plurality of 
genes, whether said individual has a good prognosis or a poor prognosis. 

5.8 KITS 

[00204] The present invention further provides for kits comprising the marker sets described 
above. The components of the kits of the present invention are preferably contained in sealed 
containers. In a preferred embodiment, the kit comprises a microarray ready for 
hybridization to target polynucleotide molecules. In specific embodiments, the kit may 
comprise any of the microarrays described in detail in Section 5.5.2. Where proteins are the 
target molecules, the kit preferably comprises a plurality of antibodies for binding to specific 
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condition-related proteins, and means for identifying such binding (e.g., means for 
performing a sandwich assay, ELISA, RIA, etc.). Such antibodies may be provided, for 
example, individually or as part of an antibody array. The kit may additionally comprise 
software for the data analyses described above, as described in detail in Section 5.9. The kit 
preferably contains one or more control samples. Such a control sample may be an artificial 
population of marker-related or marker-derived polynucleotides suitable for hybridization to 
a microarray, wherein the markers are related to or relevant to the condition of interest (for 
example, breast cancer). The control may also, or alternatively, be a set of expression values 
stored on a computer disk or other storage medium. 

[00205] The kits of the invention may be primarily diagnostic in nature; that is, they may 
assist a physician or researcher in determining a characteristic, for example, the prognosis, of 
a condition of interest, the likely response to a therapeutic regimen, the likely outcome of 
exposure to an environmental condition, such as toxin exposure, etc. The kits of the 
invention may also be used to classify individuals, for example, to place individuals into 
different groups in a clinical trial. The use of each kit is determined by the markers, 
microarrays, controls, etc. included. 

[00206] COMPUTER-FACILITATED ANALYSISThe analytic methods described in the 
previous sections can be implemented by use of the following computer systems and 
according to the following programs and methods. A computer system comprises internal 
components linked to external components. The internal components of a typical computer 
system include a processor element interconnected with a main memory. For example, the 
computer system can be based on an Intel 8086-, 803 86-, 80486-, Pentium™, or Pentium™- 
based processor with preferably 32 MB or more of main memory. The computer system may 
also be a Macintosh or a Macintosh-based system, but may also be a minicomputer or 
mainframe. 

[00207] The external components preferably include mass storage. This mass storage can be 
one or more hard disks (which are typically packaged together with the processor and 
memory). Such hard disks are preferably of 1 GB or greater storage capacity. Other external 
components include a user interface device, which can be a monitor, together with an 
inputting device, which can be a "mouse", or other graphic input devices, and/or a keyboard. 
A printing device can also be attached to the computer. 

[00208] Typically, a computer system is also linked to network link, which can be part of an 
Ethernet link to other local computer systems, remote computer systems, or wide area 
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communication networks, such as the Internet. This network link allows the computer 
system to share data and processing tasks with other computer systems. 
[00209] Loaded into memory during operation of this system are several software 
components, which are both standard in the art and special to the instant invention. These 
software components collectively cause the computer system to function according to the 
methods of this invention. These software components are typically stored on the mass 
storage device. A software component comprises the operating system, which is responsible 
for managing computer system and its network interconnections. This operating system can 
be, for example, of the Microsoft Windows® family, such as Windows 3.1, Windows 95, 
Windows 98, Windows 2000, or Windows NT, or may be of the Macintosh OS family, or 
may be UNIX, a UNIX derivative such as LINUX ? or an operating system specific to a 
minicomputer or mainframe. The software component represents common languages and 
functions conveniently present on this system to assist programs implementing the methods 
specific to this invention. Many high or low level computer languages can be used to 
program the analytic methods of this invention. Instructions can be interpreted during run- 
time or compiled. Preferred languages include C/C++, FORTRAN and JAVA. Most 
preferably, the methods of this invention are programmed in mathematical software packages 
that allow symbolic entry of equations and high-level specification of processing, including 
some or all of the algorithms to be used, thereby freeing a user of the need to procedurally 
program individual equations or algorithms. Such packages include Mathlab from 
Mathworks (Natick, MA), Mathematica® from Wolfram Research (Champaign, IL), or S- 
Plus® from Math Soft (Cambridge, MA). Specifically, the software component includes the 
analytic methods of the invention as programmed in a procedural language or symbolic 
package. 

[00210] The software to be included with the kit comprises the data analysis methods of the 
invention as disclosed herein. In particular, the software may include mathematical routines 
for marker discovery, including the calculation of similarity values between clinical 
categories (e.g., prognosis) and marker expression. The software may also include 
mathematical routines for calculating the similarity between sample marker expression and 
control marker expression, using array-generated fluorescence data, to determine the clinical 
classification of a sample. 

[0021 1] Additionally, the software may also include mathematical routines for determining 
the prognostic outcome, and recommended therapeutic regimen, for an individual with a 
condition of interest. In the specific example of breast cancer, the mathematical routines 
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would determine the prognostic outcome and recommended therapeutic regimen for an 
individual having breast cancer. Such breast cancer-specific software would include 
instructions for the computer system's processor to receive data structures that include the 
level of expression of five or more of the marker genes listed in any of Tables 1-5 in a breast 
cancer tumor sample obtained from the breast cancer patient; the mean level of expression of 
the same genes in a control or template; and the breast cancer patient's clinical information, 
including age, lymph node status and ER status. The software may additionally include 
mathematical routines for transforming the hybridization data and for calculating the 
similarity between the expression levels for the marker genes in the patient's breast cancer 
tumor sample and a control or template. In a specific embodiment, the software includes 
mathematical routines for calculating a similarity metric, such as a coefficient of correlation, 
representing the similarity between the expression levels for the marker genes in the patient's 
breast cancer tumor sample and the control or template, and expressing the similarity as that 
similarity metric. 

[00212] The software preferably would include decisional routines that integrate the patient's 
clinical and marker gene expression data, and recommend a course of therapy. In one 
embodiment, for example, the software causes the processor unit to receive expression data 
for prognosis-related genes in the patient's tumor sample, calculate a metric of similarity of 
these expression values to the values for the same genes in a template or control, compare 
this similarity metric to a pre-selected similarity metric threshold or thresholds that 
differentiate prognostic groups, assign the patient to the prognostic group, and, on the basis of 
the prognostic group, assign a recommended therapeutic regimen. In a specific example, the 
software additionally causes the processor unit to receive data structures comprising clinical 
information about the breast cancer patient. In a more specific example, such clinical 
information includes the patient's age, estrogen receptor status, and lymph node status. 
[00213] The software preferably causes the processor unit to receive data structures 
comprising relevant phenotypic and/or genotypic characteristics of the particular condition of 
interest, and/or of an individual having that condition, and classifies the individual into a 
condition subset according to those characteristics. The software then causes the processor to 
receive values for subset-specific markers, to calculate a metric of similarity of the values 
associated with those markers (e.g., level, abundance, activity, etc.) from the individual to a 
control, compare this similarity metric to a pre-selected similarity metric threshold or 
thresholds that differentiate prognostic groups, assign the patient to a prognostic group, and, 
on the basis of the prognostic group, assign a recommended therapeutic regimen. In the 
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specific example of breast cancer and a breast cancer patient, the software, in one 
embodiment, causes the processor unit to receive data structures comprising the patient's age, 
estrogen receptor status, and lymph node status, and on the basis of this data, to classify the 
patient into one of the following patient subsets: ER~ sporadic; ER~, BRCA1; ER+, 
AR/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE low, LN~ The software then 
causes the processor to receive expression values for subset-specific prognosis-informative 
gene expression in the patient's tumor sample, calculate a metric of similarity of these 
expression values to the values for the same genes in a patient subset-specific template or 
control, compare this similarity metric to a pre-selected similarity metric threshold or 
thresholds that differentiate prognostic groups, assign the patient to the prognostic group, 
and, on the basis of the prognostic group, assign a recommended therapeutic regimen. 
[00214] Where the control is an expression template comprising expression values for 
marker genes within a group of patients, e.g., breast cancer patients, the control can comprise 
either hybridization data obtained at the same time (i.e., in the same hybridization 
experiment) as the patient's individual hybridization data, or can be a set of hybridization or 
marker expression values stores on a computer, or on computer-readable media. If the latter 
is used, new patient hybridization data for the selected marker genes, obtained from initial or 
follow-up tumor samples, or suspected tumor samples, can be compared to the stored values 
for the same genes without the need for additional control hybridizations. However, the 
software may additionally comprise routines for updating the control data set, e.g., to add 
information from additional breast cancer patients or to remove existing members of the 
control data set, and, consequently, for recalculating the average expression level values that 
comprise the template. In another specific embodiment, said control comprises a set of 
single-channel mean hybridization intensity values for each of said at least five of said genes, 
stored on a computer-readable medium. 

[00215] Clinical data relating to a breast cancer patient, or a patient having another type of 
condition, and used by the computer program products of the invention, can be contained in a 
database of clinical data in which information on each patient is maintained in a separate 
record, which record may contain any information relevant to the patient, the patient's 
medical history, treatment, prognosis, or participation in a clinical trial or study, including 
expression profile data generated as part of an initial diagnosis or for tracking the progress of 
the condition, for example, breast cancer, during treatment. 

[00216] Thus, one embodiment of the invention provides a computer program product for 
classifying a breast cancer patient according to prognosis, the computer program product for 
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use in conjunction with a computer having a memory and a processor, the computer program 
product comprising a computer readable storage medium having a computer program 
mechanism encoded thereon ? wherein said computer program product can be loaded into the 
one or more memory units of a computer and causes the one or more processor units of the 
computer to execute the steps of (a) receiving a first data structure comprising said breast 
cancer patient's age, ER status, LN status and tumor type; (b) classifying said patient as ER , 
sporadic; ER~ BRCA1; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE 
low, LN""; (c) receiving a first data structure comprising the level of expression of at least two 
genes in a cell sample taken from said breast cancer patient wherein markers for said at least 
two genes are listed in Table 1 if said patient is classified as ER~, sporadic; Table 2 if said 
patient is classified as ER", sporadic; Table 3 if said patient is classified as ER+, ER/AGE 
high; Table 4 if said patient is classified as ER+, ER/AGE low, LN+; or Table 5 if said 
patient is classified as ER+, ER/AGE high, LN~; (d) determining the similarity of the level of 
expression of said at least two genes to control levels of expression of said at least two genes 
to obtain a patient similarity value; (e) comparing said patient similarity value to selected first 
and second threshold values of similarity of said level of expression of said genes to said 
control levels of expression to obtain first and second similarity threshold values, 
respectively, wherein said second similarity threshold indicates greater similarity to said 
control levels of expression than does said first similarity threshold; and (f) classifying said 
breast cancer patient as having a first prognosis if said patient similarity value exceeds said 
first and said second threshold similarity values, a second prognosis if said patient similarity 
value exceeds said first threshold similarity value but does not exceed said second threshold 
similarity value, and a third prognosis if said patient similarity value does not exceed said 
first threshold similarity value or said second threshold similarity value. In a specific 
embodiment of said computer program product, said first threshold value of similarity and 
said second threshold value of similarity are values stored in said computer. In another more 
specific embodiment, said first prognosis is a "very good prognosis," said second prognosis is 
an "intermediate prognosis," and said third prognosis is a "poor prognosis," and wherein said 
computer program mechanism may be loaded into the memory and further cause said one or 
more processor units of said computer to execute the step of assigning said breast cancer 
patient a therapeutic regimen comprising no adjuvant chemotherapy if the patient is lymph 
node negative and is classified as having a good prognosis or an intermediate prognosis, or 
comprising chemotherapy if said patient has any other combination of lymph node status and 
expression profile. In another specific embodiment, said computer program mechanism may 
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be loaded into the memory and further cause said one or more processor units of the 
computer to execute the steps of receiving a data structure comprising clinical data specific to 
said breast cancer patient. In a more specific embodiment, said single-channel hybridization 
intensity values are log transformed. The computer implementation of the method, however, 
may use any desired transformation method. In another specific embodiment, the computer 
program product causes said processing unit to perform said comparing step (e) by 
calculating the difference between the level of expression of each of said genes in said cell 
sample taken from said breast cancer patient and the level of expression of the same genes in 
said control. In another specific embodiment, the computer program product causes said 
processing unit to perform said comparing step (e) by calculating the mean log level of 
expression of each of said genes in said control to obtain a control mean log expression level 
for each gene, calculating the log expression level for each of said genes in a breast cancer 
sample from said breast cancer patient to obtain a patient log expression level, and 
calculating the difference between the patient log expression level and the control mean log 
expression for each of said genes. In another specific embodiment, the computer program 
product causes said processing unit to perform said comparing step (e) by calculating 
similarity between the level of expression of each of said genes in said cell sample taken from 
said breast cancer patient and the level of expression of the same genes in said control, 
wherein said similarity is expressed as a similarity value. In more specific embodiment, said 
similarity value is a correlation coefficient. The similarity value may, however, be expressed 
as any art-known similarity metric. 

[00217] Of course, the above breast cancer-specific examples are not limiting; analogous 
computer systems, software, and data analysis methods may be utilized for any condition of 
interest. For example, analogous software may be used to determine the prognosis of any 
other type of cancer, or of any other non-cancer diseases or conditions, using markers, 
expression level data and controls specific for that cancer, non-cancer disease or condition. 
[00218] In an exemplary implementation, to practice the methods of the present invention, a 
user first loads experimental data into the computer system. These data can be directly 
entered by the user from a monitor, keyboard, or from other computer systems linked by a 
network connection, or on removable storage media such as a CD-ROM, floppy disk (not 
illustrated), tape drive (not illustrated), ZIP® drive (not illustrated) or through the network. 
Next the user causes execution of expression profile analysis software which performs the 
methods of the present invention. 
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[00219] In another exemplary implementation, a user first loads experimental data and/or 
databases into the computer system. This data is loaded into the memory from the storage 
media or from a remote computer, preferably from a dynamic geneset database system, 
through the network. Next the user causes execution of software that performs the steps of 
the present invention. 

[00220] Additionally, because the data obtained and analyzed in the software and computer 
system products of the invention may be confidential, the software and/or computer system 
preferably comprises access controls or access control routines, such as password protection 
and preferably, particularly if information is to be transmitted between computers, for 
example, over the Internet, encryption of the data by a suitable encryption algorithm (e.g., 
PGP). 

[00221] Alternative computer systems and software for implementing the analytic methods 
of this invention will be apparent to one of skill in the art and are intended to be 
comprehended within the accompanying claims. In particular, the accompanying claims are 
intended to include the alternative program structures for implementing the methods of this 
invention that will be readily apparent to one of skill in the art. 

6. EXAMPLE: IDENTIFICATION OF PHENOTYPIC SUBSETS AND 

INFORMATIVE GENESETS FOR EACH 
[00222] Materials and Methods 
Tumor Samples : 

[00223] 311 cohort samples were collected from breast cancer patients. Selection criteria for 
sporadic patients (i.e., those not identified as having ai?i?C47-type tumor; n = 291) included: 
primary invasive breast carcinoma less than 5 cm (Tl or T2); no axillary metastases (NO); 
age at diagnosis of less than 55 years; calendar year of diagnosis 1983-1996; and no previous 
malignancies. All patients were treated by modified radical mastectomy or breast-conserving 
treatment. See van't Veer et al., Nature 415:530 (2002). Selection criteria for hereditary 
(i.e., BRCA1 -type; n = 20) tumors included: carriers of germline mutation in BRCA1 or 
BRCA2, and primary invasive breast carcinoma, van't Veer, supra. Additionally, for 
development of a classifier for the BRCA1 group, 14 BRCA1 samples previously identified 
(see van't Veer, supra) were added to the 20 BRCA1 type samples to increase sample size. 
Those 14 samples also satisfy the conditions that they are ER negative and age less than 55 
years old. 

[00224] Data analysis: 
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[00225] Sample sub-grouping: As shown in FIG. 1, tumor samples were first divided into 
ER+ and ER - branches since this is the dominant gene expression pattern. In the ER~ 
branch, the samples were further divided into "BRCA1 mutation like" and "Sporadic like" 
categories using the expression templates and 100 genes previously identified as optimal for 
determining BRCA1 status. See van't Veer et ah, Nature 415:530 (2002). In the ER+ 
category, samples were divided by ER vs. age distribution (see below) into two groups, 
"ER/AGE low" and "ER/AGE high." Within the "ER/AGE low" group, samples were 
further divided according to the lymph node status into two sub-groups: lymph node negative 
(0 lymph nodes; LN— ) and positive (> 0 lymph nodes; LN+) group. 

[00226] The result of these divisions was five distinctive sub-groups: "ER~~, sporadic" (n = 
52), "ER~, BRCA1" (n = 34), "ER+, ER/AGE high" (n = 83), "ER+, ER/AGE low, LN~" (n 
= 81), and "ER+, ER/AGE low, LN+" (n = 75). A few samples with a specific ER vs. age 
distribution in "ER+, ER/AGE low, LN+" group were further excluded to develop a 
classifier, see below for details. 

[00227] Estrogen receptor level : Estrogen receptor gene expression level was measured by a 
60mer oligo-nucleotide on a microarray. Since every individual sample was compared to a 
pool of all samples, the ratio to pool was used to measure the relative level. A threshold of 
—0.65 on logio(ratio) was used to separate the ER+ group from ER~~ group. See van't Veer et 
al, Nature 415:530 (2002). 

[00228] Grouping by ER vs. age distribution : Samples were not uniformly distributed in ER 
vs. age space among the ER+ samples (FIG. 2). First, it appeared that the ER level increases 
with age, as there were few samples from young individuals having a high ER expression 
level. For example, in the 35 to 40 years age group, samples having a log(ratio) of ER > 0.2 
are relatively few as compared to the 40 to 45 age group. In the set of samples used, the 40 < 
age < 45 group contains 30 samples having log(ratio) ER values between -0.2 to 0.2, and 28 
samples having values greater than 0.2, whereas the 35 < age < 40 group includes 24 samples 
with values between -0.2 to 0.2, but only 6 samples with values of greater than 0.2 (Fisher's 
exact test P-value: 1%). The increasing ER level with age may simply due to the fact that 
estrogen levels decrease with age, and the estrogen receptor level rises in compensation. 
[00229] There also appeared to be at least two groups of patients, as indicated by the solid 
line separating the two in FIG. 2A. A bimodality test of the separation indicated by the solid 
line yielded P-value < 10" 4 . Each of these two groups has its own trend between the ER level 
and age. The solid line can be approximated by ER = 0.1 (age - 42.5). Patients having values 
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above the solid line are referred to as the "ER/AGE high" group, and the patients below the 
line as the "ER/AGE low" group. 
[00230] Prognosis in each group: 

[0023 1] Feature selection and performance evaluation : For the prognosis in each group, non- 
informative genes were filtered in each group of patients. Specifically, only genes with 
|logio(ratio)| > logi 0 (2) and P-value (for log(ratio) + 0) < 0.01 in more than 3 experiments 
were kept. This step removed all genes that never had any significant change across all 
samples. The second step used a leave-one-out cross validation (LOOCV) procedure to 
optimize the number of reporter genes (features) in the classifier and to estimate the 
performance of the classifier in each group. The feature selection was included inside the 
loop of each LOOCV process. The final "optimal" reporter genes were selected using all of 
the "training samples" as the result of "re-substitution" because one classifier was needed for 
each group. 

[00232] Selection of training samples : Only the samples from patients who had metastases 
within 5 years of initial diagnosis (3 years for "ER~ sporadic" samples; i.e., the "poor 
outcome" group), or who were metastases-free with more than 5 years of follow-up time (i.e., 
the "good outcome" group, were used as the training set. Because the average expression 
levels for informative genes among patients who were metastasis-free, or who had early 
metastases, were used as expression templates for prediction, the training samples for the 
ER+ samples were further limited to those samples that could also be correctly classified by 
the first round of LOOCV process. For the "ER~, sporadic" samples, no such iteration was 
done because no improvement was observed. For the "ER~, BRCA1" samples, an iteration 
was done, but the training samples in the second iteration were limited to the correctly 
predicted good outcome samples from the first round of LOOCV, and all the poor outcome 
samples with metastases time less than 5 years. Further limitation of the poor outcome 
samples was not performed because of the small number of poor samples and the absence of 
improvement by such limitation. In the first round of LOOCV, except for the "ER~, 
sporadic" group, the number of features was fixed at 50 genes. A patient was predicted to 
have a favorable outcome, that is, no metastases within five years of initial diagnosis, if the 
expression of the reporter genes in a sample from the individual was more similar to the 
"average good profile" than the "average poor profile", and a poor outcome, that is, a 
metastasis within five years, if the expression of the reporter genes in the sample was more 
similar to the "average poor profile" than the "average good profile". 
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[00233] The justification for such an iteration operation is threefold. First, biologically, there 
are always a few individuals with specific reasons (different from the vast majority) to stay 
metastases free or to develop metastases. Second, statistically, most groups of patients 
include outliers that don't follow the distribution of the majority of samples. Third, 
methodologically, the iteration operation is very similar to the idea of "boosting", but instead 
of increasing the weights of the samples predicted wrong, emphasis is placed on the well 
behaved samples for selecting features and training the classifier. Since this process was 
used to select "training samples", and the performance was evaluated using the LOOCV 
(including the feature selection) after the training sample being fixed, there is no issue of 
over-fitting involved in our procedures. This method of iteration is thus more likely to reveal 
the dominant mode to metastases within each group. 

[00234] Error rate and odds ratio, threshold in the final LOOCV : Unless otherwise stated, 
the error rate was the average error rate from two populations: (1) the number of poor 
outcome samples misclassified as good outcome samples, divided by the total number of poor 
outcome samples; and (2) the total number of good outcome samples misclassified as poor 
outcome samples, divided by the total number of good samples. Two odds ratios were 
reported for a given threshold: (1) the overall odds ratio and (2) the 5 year odds ratio. The 5 
year odds ratio was calculated from samples from individuals that were metastases free for 
more than five years, and who experienced metastasis within 5 years. The threshold was 
applied to corl — cor2, where "corl" stands for the correlation to the "average good profile" 
in the training set, and "cor2" stands for the correlation to the "average poor profile" in the 
training set. 

[00235] The threshold in the final round of LOOCV was defined using the following steps: 
(1) For each of the N sample i left out for training, features based on the training set were 
selected, (2) given a feature set, an incomplete LOOCV with N-l samples was performed 
(only the "average poor profile" and "average good profile" is varied depending on whether 
the left out sample is in the training set or not), (3) the threshold based on the minimum error 
rate from N-l samples was determined, and that threshold was assigned to sample i in step 
(1), (4) the median threshold from all N samples was taken, and designated the final 
threshold. FIGS. 3-7 present detailed information about classifiers for the 5 groups: "ER~ 
sporadic", "ER", BRCA1", "ER+, ER/age high", "ER+, ER/age low, LIST", "ER+, ER/age 
low, LN+". Tables 1-5 {see Section 5.3) list the final optimal reporter genes for each of the 5 
classifiers for each of the five patient subsets. Table 6, below, summarizes the performance 
of each of the five classifiers together with thresholds used in each classifier. 
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[00236] Table 6. Performance of classifiers for each patient subset. 



Classifier 


Optimal # 
of Genes 


(C1-C2) 
Threshold 


Metastasis 
Free 


#of 

WCll I l|JICr9 


TP 


FP 


FN 


TN 


Odds 

Rat in 


95% C.I. 


ER+, ER/AGE 
j high 


50 


1.22 


Overall 


83 


31 


14 


5 


33 


14.61 


4.71- 
45.36 


5 year 


71 


24 


11 


3 


33 


24.00 


6.03- 1 
95.46 


ER+, ER/AGE 
low, LN- 


65 


0.38 


Overall 


81 


14 


6 


6 


55 


21.39 


5.98- 
76.52 


5 year 


73 


11 


4 


5 


53 


29.15 


6.73- 
126.33 


ER+, ER/AGE 
low, LN+ 


50 


-0.12 


Overall 


56 


7 


4 


6 


39 


11.38 


2.54- 
50.94 


5 year 


48 


5 


4 


3 


36 


15.00 


2.57- 
87.64 


ER-, sporadic 


20 


-0.01 


Overall 


52 


18 


7 


7 


29 


7.35 


2.16- 
25.04 


5 year 


45 


16 


5 


6 


18 


9.60 


2.45- 
37.58 


ER-, BRCA1 


10 


-0.37 


Overall 


34 


6 


3 


3 


22 


14.67 


2.34- 
92.11 


5 year 


22 


6 


1 


3 


12 


24.00 


2.04- 
282.68 



[00237] TP: True positive 



[00238] FP: False positive 
[00239] FN: False negative 
[00240] TN: True negative 

[00241] Classification method: All classifiers described herein, feature selection and 
optimization were included inside the LOOCV loop. Classifier performance was based on 
the LOOCV results. The profile based on the selected features from each patient was 
compared to the "average good profile" and "average poor profile" (by correlation) to 
determine its predicted outcome. 

[00242] Correlation calculation: The correlation between each gene's expression log(ratio) 
and the endpoint data (final outcome) was calculated using the Pearson's correlation 
coefficient. The correlation between each patient's profile and the "average good profile" 
and "average poor profile" was the cosine product (no mean subtraction). 
[00243] Results: 

[00244] The comprehensive prognosis strategy was employed on microarray expression 
profiles of 31 1 patients diagnosed before age 55 that were all part of previous studies 
establishing and validating a 70-gene prognosis profile. See van 't Veer et aL, Nature 
415:530 (2002); van de Vijver et aL, N. Engl X Med. 347:1999 (2002). In addition, 14 
known BRCA1 samples from the Nature study were included in defining the prognosis 
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classifier for the BRCA1 group. The overview of the stratifications is shown in FIG. 1. In 
each of the patient subsets, prognosis classifiers were developed and performance was 
evaluated by leave-one-out cross-validation. The biological make up of each of the 
classifiers was also examined. 

[00245] During the process to decide whether a particular clinical parameter should be used 
for the next stratification, our objectives were twofold: (1) identification of homogeneous 
prognosis patterns; and/or (2) improved prognosis in the subsets. There is a subtle balance 
between these two objectives because smaller groups will likely lead to uniform patterns 
within the group but have increasingly limited predictive power. With the exception of the 
BRCA1 subset, each group in our stratification contained 50 or more samples. 
[00246] The first layer of stratification was based on the estrogen receptor level. It was 
previously observed that estrogen receptor expression has a dominant effect on overall gene 
expression in breast cancer as seen in hierarchical clustering, van f t Veer et al, Nature 
415:530 (2002); Perou et al, Nature 406:747 (2000); Gruvberger et aL, Cancer Res. 61 :5979 
(2001). In previous analysis up to 2500 genes were significantly correlated with ER 
expression levels in tumor. See, van f t Veer et aL, Nature 415:530 (2002). According to the 
threshold defined previously (van de Vijver et al. y N Engl. J. Med. 347:1999 (2002)), 
samples were first divided into two groups according to the estrogen receptor level as 
measured by the oligo probe (accession number: NM_000125) on the array; samples with 
log(ratio) > -0.65 belong to the ER+ group, and the rest belong to ER" group). This resulted 
in 239 samples in the ER+ group and 72 samples in the ER~ group. 

[00247] In the ER+ branch it was observed that when displaying ER expression level as a 
function of age, at least two subgroups appeared to exist. (In general, any bimodality in the 
clinical data is useful.) The tumors were stratified according this bimodality (see FIG. 2). 
The group of ER+ patients having a high ER/AGE ratio was designated the "ER/AGE high" 
group (83 samples), and the remaining group of patients was designated "ER/AGE low" 
group (156 samples). 

[00248] Within the "ER/age high" group, a group of prognosis reporter genes that highly 
correlated with the outcome is identified (see Table 3). Moreover, the expression of these 
genes appeared to be very homogeneous, as indicated by high similarity in expression among 
those genes. See FIG 2A. Leave-one-out cross validation including reporter selection 
yielded an odds ratio of 14.6 (95%CI: 4.7-45.4) and 5 year odds ratio of 24.0 (95%CI: 6.0- 
95.5). Examination of those reporter genes reveals they are mostly the cell cycle genes which 
are highly expressed in the poor outcome tumors. It is worth noting that even though this 
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group includes LN+ and LN- individuals, and mixed treatment, the incidence of distant 
metastases is predicted by a biologically uniform set of genes, possibly indicating that 
proliferation is the prime driving force for disease progression. Also even though variation in 
these genes is observed in other tumor subgroups this is generally not correlated with 
outcome in those settings (see below). 

[00249] In the "ER/age low" group, no predictive pattern was found in the whole group; thus, 
the samples were further stratified into LN- (81 samples, referred to as "ER/age low LN ") 
and LN+ (75 samples, referred to as "ER/age low LN+") group. 

[00250] Within the "ER/age low LN~" group, a group of genes was identified that was 
uniformly co-regulated, and which correlated with the outcome. Leave-one-out cross- 
validation (including feature selection) yielded an odds ratio of 21.4 (95% CI: 6.0-76.5) and 5 
year odds ratio of 29.2 (95% CI: 6.7-126.3). This group of genes is also enriched for 
individual biological functions (see below). 

[00251] For the "ER/age low LN+" subset, an informative set of genes (see Table 4) was 
obtained after exclusion of several samples from older individuals having low ER levels. 
These samples are indicated in FIG. 2A as those lying below the dashed line (approximated 
as ER < 0.1*(age-50). 56 samples remained after the exclusion. This sample set allowed the 
identification of a group of genes with a highly homogeneous pattern that is useful for 
prognosis (overall odds ratio: 11.4 (2.5-50.9), 5 year odds ratio: 15.0 (2.6-87.6)). This 
suggests again that ER vs. age is an important combination for stratifying breast cancer 
patients. The reporter genes involved in this classifier also correlated with the clinical 
measure of the degree of lymphocytic infiltration (data not shown). The prediction in this 
group was not as strong as other positive groups, which may indicate the primary tumor 
carries weaker information about the metastases for this group of patients, and the metastases 
may be started from or influenced by tumors already in lymph nodes. 

[00252] In the ER" branch, because a portion of the samples are "BRCA1 -like," it is natural 
to divide the samples into "BRCAl-like" and "sporadic like". To perform the classification, 
the BRCA1 /sporadic tumor type classifier described in Roberts et aL, "Diagnosis and 
Prognosis of Breast Cancer Patients," International Publication No. WO 02/103320, which is 
hereby incorporated by reference in its entirety, to segregate the ER~ cohort samples. 52 out 
of the 72 ER~ samples were found to be "sporadic like" and 20 were found to be "BRCA1- 
like". Interestingly, the "sporadic like" group was enriched for erbb2 mutations (data not 
shown). 
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[00253] Within the "ER~, sporadic" group, no homogeneous prognosis pattern was 
identified; however, 20 genes were identified that are highly predictive of the tumor outcome 
{see Table 2). Leave-one-out cross-validation including feature selection yielded an odds 
ratio of 7.4 (95% CI 2.2-25.0) and 5 year odds ratio 9.6 (2.5 - 37.6). This result represents a 
significant improvement in prognosis compared to the previously-identified 70 gene 
prognosis classifier {see Roberts et aL, International Publication No. WO 02/103320; van f t 
Veer et al, Nature 415:530 (2002)) which has no within-group prognostic power for the ER - 
patient subset. The fact that 20 genes predict outcome and that there is no homogeneous (and 
apparent biological) pattern in this group probably indicates multiple mechanisms of 
metastasis in this group. Gene annotation indicates that genes included may be involved in 
invasion, energy metabolism and other functions. 

[00254] For the "ER~, BRCA1 -like" group, we added 14 BRCA1 mutation carrier samples 
from a previous study were added to increase the number of samples. Those 14 extra 
samples also satisfied the following selection criteria: ER negative and age less than 55 years. 
The leave-one-out cross validation process identified 10 genes that are predictive of final 
outcomes. The overall odds ratio is 14.7 (95% CI: 2.3-92.1) and the 5 year odds ratio is 24.0 
(95% CI: 2.0-282.7). 

[00255] Because no homogeneous gene expression patterns were found in ER" branch, the 
predictive power of those genes was further validated. One means of further validation was 
to review the different classifier gene sets for biological interpretations and to identify genes 
within each classifier that gave indications as to the origins of the tumors. 
[00256] The "ER+, ER/AGE high" group yielded a classifier highly enriched for cell cycle 
genes with both Gl/S and G2/M phases represented. In this group, over-expression of 46 of 
the 50 genes was associated with disease progression including all the known cell cycle 
genes. This is consistent with rapid growth being the determinant of metastatic potential. 
Four genes in this classifier were anti-correlated with outcome and cell cycle. One of these 
genes encodes follistatin, which binds to and inhibits activin and other members of the TGFp 
family (Lin et aL 9 Reproduction 126:133 (2003)), the members of which have many 
functions, including growth stimulation. Tumor grade also accurately predicted metastatic 
potential in this group (overall odds ratio: 5.9, 95% CI: 2.0-18.0, 5 year odds ratio: 12.5, 95% 
CI: 2.6-59.3) and was also correlated with the expression level of these genes, which is 
consistent with rate of growth being the primary determinant of disease progression. This set 
of genes had a significantly lower correlation with outcome in the other patient subsets, even 
though coordinate and similarly variable expression was seen. For example, many tumors in 
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the "ER~ sporadic" group had high cell cycle and low FST expression, but the expression of 
these genes in these groups was minimally correlated with outcome, indicating that growth 
was not the primary determinant of outcome here (see FIGS. 8 A and 8B). 
[00257] The ER+, ER/AGE low, LN~ group yielded a classifier rich in both genes for 
glycolytic enzymes (12 of 56) and genes induced by hypoxia and/or angiogenesis (14 of 56) 
with 5 genes falling into both categories. These genes were positively correlated with poor 
outcome, implying that energy metabolism (glycolysis), angiogenesis and adaptation to 
hypoxia were critical pathways in this subgroup of tumors. None of these genes appeared in 
the classifiers for the other patient subsets, and there was a much reduced predictive value of 
these genes in the other tumors, even though coordinate and similarly variable expression was 
seen (see FIG. 8C and 8D). 

[00258] The implication of the above analyses is that certain well known functions (growth, 
angiogenesis, energy metabolism) are important in certain tumor types and not in others, and 
therefore therapies that target these functions will be likely be similarly effective in some 
tumor subgroups and not in others. For example therapies that target cell cycle progression, 
such as taxol or the vinca alkaloids, may be optimally effective in the ER+, ER/AGE high 
group, where overexpression of cell cycle genes predominates in the classifier. In contrast, 
tumor subgroups in which variation in cell cycle expression is not correlated with outcome 
may be less sensitive to taxol or the vinca alkaloids. 

[00259] The "comprehensive prognosis" approach significantly improved the prediction error 
rate when compared with 70 gene classifier (Table 7). To make the comparison fair, we 
listed two sets of results from the 70 gene classifier. The first results from the use of the 
same threshold applied to all the patient subsets (threshold previously optimized for false 
negative rate); the second one results from the use of a threshold optimized for each patient 
subset (optimized for average error rate). The comprehensive approach lowered the error rate 
by at least 6%. 

Table 7. Average error rate for the patient subset approach compared with the previously- 



described 70 gene classifier. 



Prognosis method 


over all error rate 


5 year error rate 


70 gene, fix thresh 


30.90% 


25.70% 


70 gene, opt thresh 


28.60% 


27.60% 


Comprehensive 


21.50% 


19.30% 



[00260] Fix thresh: use of a fixed threshold in the classifier as previously determined. 
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[00261] Opt threshold: use of a threshold optimized for each sub-group. For the "ER/Age 
low, LN+" subgroup, 56 samples used for developing the classifier were included here, 
resulted in 306 samples in total. 
[00262] 

7. REFERENCES CITED 

[00263] All references cited herein are incorporated herein by reference in their entirety and 
for all purposes to the same extent as if each individual publication or patent or patent 
application was specifically and individually indicated to be incorporated by reference in its 
entirety for all purposes. 

[00264] Many modifications and variations of the present invention can be made without 
departing from its spirit and scope, as will be apparent to those skilled in the art. The specific 
embodiments described herein are offered by way of example only, and the invention is to be 
limited only by the terms of the appended claims along with the full scope of equivalents to 
which such claims are entitled. 
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WHAT IS CLAIMED IS : 

1 . A method of identifying a set of informative genes or markers for a 
condition comprising a plurality of phenotypic or genotypic characteristics, comprising: 

(a) classifying each of a plurality of samples or individuals on the basis of 
one or more phenotypic or genotypic characteristics of said condition into a plurality 
of first classes; and 

(b) identifying within each of said first classes a first set of genes or 
markers informative for said condition 

wherein said first set of genes or markers within each of said first classes is unique to said 
class relative to other first classes. 

2. The method of claim 1, which further comprises additionally classifying into a 
plurality of second classes said samples or individuals in at least one of said first classes on 
the basis of a phenotypic or genotypic characteristic different that that used in said classifying 
step (a); and identifying within at least one of said second classes a second set of informative 
genes or markers, wherein said second set of informative genes or markers within each of 
said second classes is unique to said second class relative to other first and second classes. 

3. A method of identifying a set of informative genes or markers for a condition 
comprising a plurality of phenotypic or genotypic characteristics, comprising: 

(a) classifying each of a plurality of samples or individuals on the basis of 
one or more phenotypic or genotypic characteristics into a plurality of first classes; 

(b) classifying at least one of said first classes into a plurality of second 
classes on the basis of phenotypic or genotypic characteristic different than that used 
in said classifying step (a); and 

(c) identifying within at least one of said first classes or said second 
classes a set of genes or markers informative for said condition, 

wherein said second set of genes or markers is unique to said class relative to other first and 
second classes. 
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4. A method of identifying a set of informative genes or markers for a condition 
comprising a plurality of phenotypic or genotypic characteristics, comprising: 

(a) selecting a first characteristic from said plurality of phenotypic or 
genotypic characteristics; 

(b) identifying at least two first condition classes differentiable by said 
first characteristic; 

(c) selecting a plurality of individuals classifiable into at least one of said 
first condition classes; and 

(d) identifying in samples derived from each of said plurality of 
individuals a set of genes or markers informative for said condition within said at least 
one of said first condition classes. 

5. A method of classifying an individual with a condition as having a good 
prognosis or a poor prognosis, comprising: 

(a) classifying said individual into one of a plurality of patient classes, said 
patient classes being differentiated by one or more phenotypic, genotypic or clinical 
characteristics of said condition; 

(b) determining the level of expression of a plurality of genes or their 
encoded proteins in a cell sample taken from the individual relative to a control, said 
plurality of genes or their encoded proteins comprising genes or their encoded 
proteins informative for prognosis of the patient class into which said individual is 
classified; and 

(c) classifying said individual as having a good prognosis or a poor 
prognosis on the basis of said level of expression. 

6. The method of claim 5, wherein said condition is cancer, said good prognosis 
is the non-occurrence of metastases within five years of initial diagnosis, and said poor 
prognosis is the occurrence of metastases within five years of initial diagnosis. 
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7. The method of claim 5, wherein said control is the average level of expression 
of each of said plurality of genes or their encoded proteins across a plurality of samples 
derived from individuals identified as having a poor prognosis. 

8. The method of claim 7, in which said classifying step (c) is carried out by a 
method comprising comparing the level of expression of each of said plurality of genes or 
their encoded proteins to said average level of expression of each corresponding gene or its 
encoded protein in said control, and classifying said individual as having a poor prognosis if 
said level of expression correlates with said average level of expression of each of said genes 
or their encoded proteins in said control more strongly than would be expected by chance. 

9. The method of claim 5, wherein said control is the average level of expression 
of each of said plurality of genes or their encoded proteins across a plurality of samples 
derived from individuals identified as having a good prognosis. 

10. The method of claim 9, in which said classifying in step (c) is carried out by a 
method comprising comparing the level expression of each of said plurality of genes or their 
encoded proteins to said average level of expression of each corresponding gene or its 
encoded protein in said control, and classifying said individual as having a good prognosis if 
said level of expression correlates with said average level of expression of each of said genes 
or their encoded proteins in said control more strongly than would be expected by chance. 

1 1 . The method of claim 5, wherein said plurality of patient classes comprises 
ER~ BRCAI individuals; EFT, sporadic individuals; ER+, ER/AGE high individuals; ER+, 
ER/AGE low, LN4- individuals; and ER+, ER/AGE low, LN~ individuals. 

12. A method of classifying a breast cancer patient as having a good prognosis or 
a poor prognosis comprising: 

(a) classifying said breast cancer patient as ER~~, BRCAI; ER" sporadic; 
ER+, ER/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE low, LN~; 

(b) determining the level of expression of a first plurality of genes in a cell 
sample taken from said breast cancer patient relative to a control, said first plurality of 
genes comprising two of the genes corresponding to the markers in Table 1 if said 
breast cancer patient is classified as ER~ BRCAI; in Table 2 if said breast cancer 
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patient is classified as ER~ sporadic; in Table 3 if said breast cancer patient is 
classified as ER+, ER/AGE high; in Table 4 if said breast cancer patient is classified 
as ER-h ER/AGE low, LN+; or in Table 5 if said breast cancer patient is classified as 
ER+, ER/AGE low, LN~; and 

(c) classifying said breast cancer patient as having a good prognosis or a 
poor prognosis on the basis of the level of expression of said first plurality of genes, 

wherein said breast cancer patient is "ER/AGE high 9 ' if the ratio of the logi 0 (ratio) of ER 
gene expression to age exceeds a predetermined value, and "ER/AGE low" if the ratio of the 
log!o(ratio) of ER gene expression to age does not exceed said predetermined value. 

13. The method of claim 12, wherein said control is the average level of 
expression of each of said plurality of genes in a plurality of samples derived from ER", 
BRCA1 individuals, if said breast cancer patient is ER", BRCA1; the average level of 
expression of each of said plurality of genes in a plurality of samples derived from ER", 
sporadic individuals if said breast cancer patient is ER", sporadic; the average level of 
expression of each of said plurality of genes in a plurality of samples derived from ER+, 
ER/AGE high individuals, if said breast cancer patient is ER+, ER/AGE high; the average 
level of expression of each of said plurality of genes in a plurality of samples derived from 
ER+, ER/AGE low, LN+ individuals where said breast cancer patient is ER+, ER/AGE low, 
LN+; or the average level of expression of each of said plurality of genes in a plurality of 
samples derived from ER+, ER/AGE low, LN" individuals where said breast cancer patient is 
ER+, ER/AGE low, LN". 

14. The method of claim 13, wherein each of said individuals has a poor 
prognosis. 

15. The method of claim 13, wherein each of said individuals has a good 
prognosis. 

16. The method of claim 14, wherein said classifying step (c) is carried out by a 
method comprising comparing the level of expression of each of said plurality of genes or 
their encoded proteins in a sample from said breast cancer patient to said control, and 
classifying said breast cancer patient as having a poor prognosis if said level of expression 
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correlates with said average level of expression of the corresponding genes or their encoded 
proteins in said control more strongly than would be expected by chance. 

1 7. The method of claim 1 2, wherein said predetermined value of ER is calculated 
as ER = 0.1 (AGE - 42.5), wherein AGE is the age of said individual. 

1 8. The method of claim 12, wherein said individual is ER", BRCA1, and said 
plurality of genes comprises two of the genes for which markers are listed in Table 1. 

19. The method of claim 12, wherein said individual is ER", BRCA1, and said 
plurality of genes comprises all of the genes for which markers are listed in Table 1. 

20. The method of claim 12, wherein said individual is ER", sporadic, and said 
plurality of genes comprises two of the genes for which markers are listed in Table 2. 

21 . The method of claim 12, wherein said individual is ER", sporadic, and said 
plurality of genes comprises all of the genes for which markers are listed in Table 2. 

22. The method of claim 12, wherein said individual is ER+, ER/AGE high, and 
said plurality of genes comprises two of the genes for which markers are listed in Table 3. 

23. The method of claim 12, wherein said individual is ER+, ER/AGE high, and 
said plurality of genes comprises all of the genes for which markers are listed in Table 3. 

24. The method of claim 12, wherein said individual is ER+, ER/AGE low, LN+, 
and said plurality of genes comprises two of the genes for which markers are listed in Table 
4. 

25. The method of claim 12, wherein said individual is ER+, ER/AGE low, LN+, 
and said plurality of genes comprises all of the genes for which markers are listed in Table 4. 

26. The method of claim 12, wherein said individual is ER+, ER/AGE low, LN", 
and said plurality of genes comprises two of the genes for which markers are listed in Table 
4. 

27. The method of claim 12, wherein said individual is ER+, ER/AGE low, LN", 
and said plurality of genes comprises all of the genes for which markers are listed in Table 4. 
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28. The method of claim 12, further comprising determining in said cell sample 
the level of expression, relative to a control, of a second plurality of genes for which markers 
are not found in Tables 1-5, wherein said second plurality of genes is informative for 
prognosis. 

29. A method for assigning an individual to one of a plurality of categories in a 
clinical trial, comprising: 

(a) classifying said individual as ERT, BRCA1, ER", sporadic; ER+, 
ER/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE low, LN~; 

(b) determining for said individual the level of expression of at least two 
genes for which markers are listed in Table 1 if said individual is classified as ER~, 
BRCA1; Table 2 if said individual is classified as ER" sporadic; Table 3 if said 
individual is classified as ER+, ER/AGE high; Table 4 if said individual is classified 
as ER+, ER/AGE low, LN+; or Table 5 if said individual is classified as ER+, 
ER/AGE low, LIST; 

(c) determining whether said individual has a pattern of expression of said 
at least two genes that correlates with a good prognosis or a poor prognosis; and 

(d) assigning said individual to one category in a clinical trial if said 
individual has a good prognosis, and assigning said individual to a second category in 
said clinical trial if said individual has a poor prognosis. 

30. The method of claim 29, wherein said individual is additionally assigned to a 
category in said clinical trial on the basis of the classification of said individual as determined 
in step (a). 

3 1 . The method of claim 29, wherein said individual is additionally assigned to a 
category in said clinical trial on the basis of any other clinical, phenotypic or genotypic 
characteristic of breast cancer. 

32. The method of claim 29, further comprising determining in said cell sample 
the level of expression, relative to a control, of a second plurality of genes for which markers 
are not found in Tables 1-5, wherein said second plurality of genes is informative for 
prognosis of breast cancer, and determining from the expression of said second plurality of 
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genes, in addition to said first plurality of genes, whether said individual has a good 
prognosis or a poor prognosis. 

33. A method of identifying a set of genes informative for a condition, said 
condition having a plurality of phenotypic or genotypic characteristics such that samples may 
be categorized by at least one of said phenotypic or genotypic characteristics into at least one 
characteristic class, said method comprising: 

(a) selecting a plurality of samples from individuals having said condition; 

(b) identifying a first set of genes informative for said characteristic class 
using said plurality of samples; 

(c) predicting the characteristic class of each of said plurality of samples; 

(d) discarding samples for which said characteristic class is incorrectly 
predicted; 

(e) repeating steps (c) and (d) at least once; and 

(f) identifying a second set of genes informative for said characteristic 
class using samples in said plurality of samples remaining after step (e). 

34. The method of claim 6, wherein said cancer is breast cancer. 

35. A method for assigning an individual to one of a plurality of categories in a 
clinical trial, comprising: 

(a) classifying the individual into one of a plurality of condition categories 
differentiated by at least one genotypic or phenotypic characteristic of the condition; 

(b) determining the level of expression, in a sample derived from said 
individual, of a plurality of genes informative for said condition category; 

(c) determining whether said level of expression of said plurality of genes 
indicates that the individual has a good prognosis or a poor prognosis; and 

(d) assigning the individual to a category in a clinical trial on the basis of 
prognosis. 
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36. A method for identifying one or more sets of informative genes or markers for a 
condition in an organism, comprising: 

(a) subdividing a plurality of individuals or samples derived therefrom of said 
organism subject to said condition into a plurality of classes based on one or more clinical, 
phenotypic or genotypic characteristics of said organism, wherein each said class consists of 
a plurality of individuals or samples derived therefrom of said organism each having said one 
or more clinical, phenotypic or genotypic characteristics specific for said class; and 

(b) attempting to identify for each of one or more of said plurality of classes a set of 
genes or markers informative for said condition in individuals in said class, 

wherein, if a set of genes or markers informative for said condition in individuals in said class 
is obtained for any of said one or more of said plurality of classes, said set of genes or 
markers is taken as a set of informative genes or markers for said condition in said organism. 

37. The method of claim 36, further comprising, for each of one or more of said 
classes in which a set of genes or markers informative for said condition in individuals in said 
class cannot be obtained, repeating said steps (a) and (b) on said plurality of individuals or 
samples derived therefrom in said class such that said plurality of individuals or samples 
derived therefrom in said class is subdivided into a plurality of additional classes based on 
one or more clinical, phenotypic or genotypic characteristics of said organism which are 
different from those used for defining said class, wherein, for each of said plurality of 
additional classes, if a set of genes or markers informative for said condition in individuals in 
said class is obtained, said set of genes or markers is taken as a set of informative genes or 
markers for said condition in said organism. 

38. A method for identifying one or more sets of informative genes or markers for a 
condition in an organism, comprising: 

(a) subdividing a plurality of individuals or samples derived therefrom of said 
organism subject to said condition into a plurality of classes based on one or more clinical, 
phenotypic or genotypic characteristics of said organism, wherein each said class consists of 
a plurality of individuals or samples derived therefrom of said organism each having said one 
or more clinical, phenotypic or genotypic characteristics specific for said class; 
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(b) attempting to identify for each of one or more of said plurality of classes a set of 
genes or markers informative for said condition in individuals in said class, wherein if a set of 
genes or markers informative for said condition in individuals in said class is identified for 
any of said one or more of said classes, said set of genes or markers is taken as a set of 
informative genes or markers for a condition in said organism; and 

(c) for each of one or more of said classes in which a set of genes or markers 
informative for said condition in individuals in said class cannot be obtained, repeating said 
steps (a) and (b) on said plurality of individuals or samples derived therefrom in said class 
such that said plurality of samples or individuals in said class is subdivided into a plurality of 
additional classes based on one or more clinical, phenotypic or genotypic characteristics of 
said organism which are different from those used those used for defining said class, wherein, 
for each of one or more of said plurality of additional classes, if a set of genes or markers 
informative for said condition in individuals in said class is obtained, said set of genes or 
markers is taken as a set of informative genes or markers for a condition in said organism. 

39. The method of claim 38, wherein said condition is a type of cancer, and wherein 
each of said sets of genes or markers is informative of prognosis of individuals in a 
corresponding class. 

40. The method of claim 39, wherein said condition is breast cancer, and wherein said 
one or more clinical, phenotypic or genotypic characteristics comprises age, ER level, 
ER/AGE, BRAC1 status, and lymph node status. 

41. The method of claim 39, further comprising generating a template profile 
comprising measurements of levels of genes or markers of said set for said class 
representative of levels of the genes or markers in a plurality of patients having a chosen 
prognosis level. 

42. A method for predicting a breast cancer patient as having a good prognosis or a 
poor prognosis, comprising: 

(a) classifying said breast cancer patient into one of the following classes: (al) ER , 
BRCAI; (a2) ER", sporadic; (a3) ER+, ER/AGE high; (a4) ER+, ER/AGE low, LN+; or (a5) 
ER+, ER/AGE low, LN~; 
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(b) deter minin g a profile comprising measurements of a plurality of genes or markers 
in a cell sample taken from said breast cancer patient, said plurality of genes markers 
comprising at least two of the genes or markers corresponding to the markers in (bl) Table 1 
if said breast cancer patient is classified as ER" BRCAI; (b2) Table 2 if said breast cancer 
patient is classified as ER" sporadic; (b3) Table 3 if said breast cancer patient is classified as 
ER+, ER/AGE high; (b4) Table 4 if said breast cancer patient is classified as ER+, ER/AGE 
low, LN+; or (b5) Table 5 if said breast cancer patient is classified as ER+, ER/AGE low, 
LIST; and 

(c) classifying said breast cancer patient as having a good prognosis or a poor 
prognosis based on said profile of said plurality of genes or markers, 

wherein ER + designates a high ER level and ER' designates a low ER level, wherein said 
ER/AGE is a metric of said ER level relative to the age of said patient, and wherein LN+ 
designates a greater than 0 lymph nodes status in said patient and LN" designates a 0 lymph 
nodes status in said patient. 

43. The method of claim 42, wherein step (c) is carried out by a method comprising 
comparing said profile to a good prognosis template and/or a poor prognosis template, and 
wherein said patient is classified as having a good prognosis if said profile has a high 
similarity to a good prognosis template or has a low similarity to a poor prognosis template or 
as having a poor prognosis if said profile has a low similarity to a good prognosis template or 
has a high similarity to a poor prognosis template, said good prognosis template comprising 
measurements of said plurality of genes or markers representative of levels of said genes or 
markers in a plurality of good outcome patients and said poor prognosis template comprising 
measurements of said plurality of genes or markers representative of levels of said genes or 
markers in a plurality of poor outcome patients, wherein a good outcome patient is a breast 
cancer patient who has non-reoccurrence of metastases within a first period of time after 
initial diagnosis and a poor outcome patient is a patient who has reoccurrence of metastases 
within a second period of time after initial diagnosis. 

44. The method of claim 43, further comprising determining said profile, said ER 
level, said LN status, and/or, said ER/AGE. 

45. The method of claim 44, wherein said profile is an expression profile comprising 
measurements of a plurality of transcripts in a sample derived from said patient, wherein said 
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good prognosis template comprises measurements of said plurality of transcripts 
representative of expression levels of said transcripts in said plurality of good outcome 
patients, and wherein said poor prognosis template comprises measurements of said plurality 
of transcripts representative of expression levels of said transcripts in said plurality of poor 
outcome patients. 

46. The method of claim 45, wherein said expression profile is a differential 
expression profile comprising differential measurements of said plurality of transcripts in said 
sample derived from said patient versus measurements of said plurality of transcripts in a 
control sample. 

47. The method of claim 43, wherein said profile comprises measurements of a 
plurality of protein species in a sample derived from said patient, wherein said good 
prognosis template comprises measurements of said plurality of protein species representative 
of levels of said protein species in said plurality of good outcome patients, and wherein said 
poor prognosis template comprises measurements of said plurality of protein species 
representative of levels of said protein species in said plurality of poor outcome patients. 

48. The method of claim 46, wherein measurement of each said transcript in said good 
prognosis template is an average of expression levels of said transcript in said plurality of 
good outcome patients. 

49. The method of claim 48, wherein similarity of said expression profile to said good 
prognosis template is represented by a correlation coefficient between said expression profile 
and said good prognosis template, wherein said correlation coefficient greater than a 
correlation threshold indicates a high similarity and said correlation coefficient equal to or 
less than said correlation threshold indicates a low similarity. 

50. The method of claim 48, wherein similarity of said expression profile to said good 
prognosis template is represented by a distance between said cellular constituent profile and 
said good prognosis template, wherein said distance less than a given value indicates a high 
similarity and said distance equal to or greater than said given value indicates a low 
similarity. 

5 1 . The method of claim 49, wherein said correlation threshold is 0.5. 
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52. The method of claim 51 , wherein said ER level is determined by measuring an 
expression level of a gene encoding said estrogen receptor in said patient relative to 
expression level of said gene in said control sample, and wherein said ER level is classified 
as ER + if logl0(ratio) of said expression level is greater than -0.65, and wherein said ER level 
is classified as ER" if loglO(ratio) of said expression level is equal to or less than -0.65. 

53. The method of claim 52, wherein said gene encoding said estrogen receptor is the 
estrogen receptor a gene. 

54. The method of claim 53, wherein said ER/AGE is classified as high if said ER 
level is greater than c(AGE — d), and wherein said ER/AGE is classified as low if said ER 
level is equal to or less than c-( AGE - d), wherein c is a coefficient, AGE is the age of said 
patient, and d is an age threshold. 

55. The method of claim 54, wherein said estrogen receptor level is measured by a 
polynucleotide probe that detects a transcript corresponding to the gene having accession 
number NM 000 1 25, wherein said control sample is a pool of breast cancer cells of different 
patients, and wherein c = 0.1 and d = 42.5. 

56. The method of claim 55, wherein said control sample is generated by pooling 
together cDNAs of said plurality of transcripts from a plurality of breast cancer patients. 

57. The method of claim 55, wherein said control sample is generated by pooling 
together synthesized cDNAs of said plurality of transcripts and said transcript of said gene 
encoding said estrogen receptor. 

58. The method of claim 42, wherein said individual is ER", BRCA1, and said 
plurality of genes comprises at least two of the genes for which markers are listed in Table 1 . 

59. The method of claim 42, wherein said individual is ER~, BRCA1, and said 
plurality of genes comprises all of the genes for which markers are listed in Table 1 . 

60. The method of claim 42, wherein said individual is ER~, sporadic, and said 
plurality of genes comprises at least two of the genes for which markers are listed in Table 2. 

61. The method of claim 42, wherein said individual is ER"*, sporadic, and said 
plurality of genes comprises all of the genes for which markers are listed in Table 2. 
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62. The method of claim 42, wherein said individual is ER+, ER/AGE high, and said 
plurality of genes comprises at least two of the genes for which markers are listed in Table 3. 

63. The method of claim 42, wherein said individual is ER+, ER/AGE high, and said 
plurality of genes comprises all of the genes for which markers are listed in Table 3. 

64. The method of claim 42, wherein said individual is ER+, ER/AGE low, LN+, and 
said plurality of genes comprises at least two of the genes for which markers are listed in 
Table 4. 

65. The method of claim 42, wherein said individual is ER+, ER/AGE low, LN+, and 
said plurality of genes comprises all of the genes for which markers are listed in Table 4. 

66. The method of claim 42, wherein said individual is ER+, ER/AGE low, LIST, and 
said plurality of genes comprises at least two of the genes for which markers are listed in 
Table 4. 

67. The method of claim 42, wherein said individual is ER+, ER/AGE low, LIST, and 
said plurality of genes comprises all of the genes for which markers are listed in Table 4. 

68. The method of claim 42, wherein said profile further comprises one or more genes 
for which markers are not found in Tables 1-5, wherein said one or more genes are 
informative for prognosis. 

69. A method for assigning an individual to one of a plurality of categories in a 
clinical trial, comprising assigning said individual to one category in a clinical trial if said 
individual has a good prognosis as determined by the method of any one of claims 7-33, and 
assigning said individual to a second category in said clinical trial if said individual has a 
poor prognosis as determined by the method of any one of claims 7-33. 

70. The method of claim 69, wherein said individual is additionally assigned to a 
category in said clinical trial on the basis of the classification of said individual as determined 
in step (a). 

71. The method of claim 69, wherein said individual is additionally assigned to a 
category in said clinical trial on the basis of one or more other clinical, phenotypic or 
genotypic characteristic of breast cancer. 
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72. The method of claim 69, further comprising determining in said cell sample the 
levels of expression of said one or more genes for which markers are not found in Tables 1-5, 
and determining from said expression levels of said one or more genes, whether said 
individual has a good prognosis or a poor prognosis. 

73. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in any one of Tables 
1-5. 

74. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in Table 1 . 

75. The microarray of claim 74, wherein said plurality of polynucleotide probes 
comprises a probe complementary and hybridizable to each of the genes listed in Table 1. 

76. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in Table 2. 

77. The microarray of claim 76, wherein said plurality of polynucleotide probes 
comprises a probe complementary and hybridizable to a sequence in each of the genes listed 
in Table 2. 

78. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in Table 3. 

79. The microarray of claim 78, wherein said plurality of polynucleotide probes 
comprises a probe complementary and hybridizable to a sequence in each of the genes listed 
in Table 3. 

80. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in Table 4. 

81. The microarray of claim 80, wherein said plurality of polynucleotide probes 
comprises a probe complementary and hybridizable to a sequence in each of the genes listed 
in Table 4. 

82. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in Table 5. 
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83. The microarray of claim 82, wherein said plurality of polynucleotide probes 
comprises a probe complementary and hybridizable to a sequence in each of the genes listed 
in Table 5. 

84. The microarray of any of claims 73-83, wherein said plurality of polynucleotide 
probes constitutes at least 50% of the probes on said microarray. 

85. The microarray of any of claims 73-83, wherein said plurality of polynucleotide 
probes constitutes at least 90% of the probes on said microarray. 

86. The microarray of claim 73, wherein said plurality of polynucleotide probes 
comprises probes complementary and hybridizable to at least 75% of the genes listed in Table 
1, Table 2, Table 3, Table 4, or Table 5, wherein said plurality of polynucleotide probes, in 
total, constitutes at least 50% of the probes on said microarray. 

87. A kit comprising the microarray of any one of claims 73-83 in a sealed container. 
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Sequence Listing 
<110> Rosetta Inpharmatics, LLC 

<120> Classification of Breast Cancer Patients Using a Combination 
of Clinical Criteria and Informative Genesets 



<130> 9301-251-228 

<140> 
<141> 

<150> 60/650,401 
<151> 2005-02-04 

<150> 60/604,076 
<151> 2004-08-24 

<150> 60/550,810 
<151> 2004-03-05 

<160> 366 

<210> 1 

<211> 4946 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AB032969 
<400> 1 

cagcctcagc ccccagatga agatggggat cacagtgaca aagaagatga acagcctcaa 6 0 

gtggtggttt taaaaaaggg agacctgtca gttgaagaag tcatgaaaat taaagcagaa 12 0 

ataaaggctg ccaaagcaga tgaagaacca actccagccg atggaagaat catatatcga 180 

aaaccagtca agcatccctc agatgaaaaa tattcaggtt taacagcaag ctcaaaaaag 2 40 

aagaagccaa atgaagatga agtaaatcag gactcggtca aaaagaactc acaaaaacaa 3 00 

attaaaaata gtagcctcct ttcttttgac aacgaagatg aaaatgagta agtgtaaata 3 60 

ttttgaattt agtctacttt gaaagtatat ggagtgttca ttaaaatcac attttttcct 42 0 

attataaaga tactacaagt tctttataga aagtttagga aatagagaaa aaaatttaat 480 

aaactacatc tattcatcaa tacccctctg acttaaaatg ccaactctat agaaattagc 540 

tagtattaac attttgttat ttcccttgtg tggttgtata tatatgtaaa ttatattttt 600 

aagcaaaata cattttttgt gtgtaaacaa aattttataa atacaactgt attgcaaatg 660 

ttctttgtcc tgcttctcac ttgacattgc attatgagta ttcttccagg tcagtaaatt 72 0 

tcaaaaacct gacattaata gctacagata atttcataaa catctcattg tatctttttc 780 

attagcaata gctccacttt gggtggggga gatgataatg tgccttgtta aaaatacctc 840 

cccaactcct gctaagggtg gccatgagac tcagctctgg caagttaaga aatacaggtg 900 

gaattctgct tgataaagct gctgggtttt ttgttacaaa aggacagact tggcaaacat 960 

gagcctttgc tcttatcttt tcatcctact tggagtgcag agataaaacc tgagtaccag 102 0 

agccactttt aggcataagg aaggcagcca tgtgctttgg gtcatgttag taaaaagact 1080 

cagagcttgg ctccttgctg acatgcctgg aggagctgct acaccagctt ggattgctga 1140 

cctctgactt cttggtagtg agaagaataa acactgtgct taattaggcc ttggtcaggt 1200 

ttcttttata tgcagccaaa tgcagtccta agtaatacaa taaataactg gtcaaactgt 1260 

tactggtgga gggtgtccag gttcttggca ttttggacaa ataattgaac aaaacgcaca 132 0 

aagcaatgaa tatcctctag aggtttgcca ttggttactt ggcgtacacc ctgtgtaaat 1380 

gaagtagtgg cccgtgacct gtctgattgg tgcagaaagt gaccaatcag aggctgaagt 1440 

gaagttacaa agttatactc ctgtgtaaat gaggacttgg cctatgacca gtctgattgg 1500 

ttgcaggagg ggaccaatca gaggcacttt catttttcat ctgcaatgca gaaaaggcaa 1560 

ggggattgca aagggagtag cctctgatcc ttttgttact taggtatgga gaggtggggt 1620 

tttccttttg attcagttct aggaagtcaa tgtgaatcag ccttaggttc cctgtctcca 1680 
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gaccctattc tcctgcctca ttttccccct gagagacgtg atcctcgtaa atctttatgg 1740 

gaggctgaga gactgagggt ctttcttctg taactgcttc atgctaactt gggacacagt 18 00 

ccctacctat tggagatcac gtaactctca ccctgctfctg tctaggggag acagggtagc 1860 

ttcttgatgg ccggtggtgt cttctcctga aactggctag aaatcttgtc acatgatcat 1920 

ctaacttggt ggtctctagg caaaaggaaa tggatttggt taaaagattt aacagatatg 1980 

gtccaaaaac caaggcaaat ataatcatta ataatgggct ggccaaggga gggagccatg 2 040 

aaacccaact tagtgccctt taggtgcccc agctgttgtc atattttaga ggcccagtca 2100 

gctagttttc aggtggtgtc ccttactaat cctgattggt tgacatcaaa acagcattct 2160 

tcttctagga aaatacataa gccacctgtt tcagcagtta ggagatctag tccccttcga 2 220 

ttttgcaaag cgaccactgc caaggagcct atccgaattt gtaaggtgac aatactttga 2 2 80 

gcaatgttat ccaggctttc cataaaatcc ttggacaagc gttggtaafca ggatagggaa 2340 

gttgcaatcc cgctaactcc cattcctacc tctgctgtta ttcctagccg ttgtgtctgg 2400 

tggttgcagt taaaggtata atgagggatt ggttgttggg agctatatta atttagggac 2 460 

atacaatatt tctgtctcca gtctaccact tccaccaaag acaaatcaca gcagaaccga 2520 

cctaacttca aaataaactg cagtcccata tactgggcct gattacccac acaaagtgca 2 5 80 

acaagaatca ttgtccatat agactctcct agattggctt tgctagaaca tttcacaagg 2 640 

ccatttcagt caaagtcctg agaaagtaac cggtttcaat tgtgccctat tacaaaagaa 2 7 00 

aacgtggtta ttaactttat acagacaaat gccatgaatt aagaatattc ataaatagtt 27 60 

tacaaattct ggagaaatta gaatactcaa tacacttaaa gtgtatttca aggctataaa 2 820 

tagctcaaaa taaaaagatt attcagactc tgaaaaaaca aaaagaagta gcaatatttc 2 880 

aaacaacaaa agccatacaa attatttcag tcttccatta gttcatttca gtccatgtaa 2 9 40 

tcaactcctg ctctacttca tattcatctt tatgaacaca tcagcctttc aattagtgcc 3 000 

ttggaagttt tctgtctaat ccaatggcac actctccaaa gttaccagaa acctgcattc 3 060 

aagagttctt ttcatgaact ccaaagaagt aagccttgga ctgtagctga ttataagtca 312 0 

cttttttttt ttgagaagga tcaaagcaaa acatcaatta tggatgacaa aagtcttaag 3180 

acagccataa agacacagtt gacaaatgtg gctatttctg tggcttacaa caatttaaca 32 40 

taatcattac aacatatatt aagacatatc agaattttag aactctcata caatcctgga 33 00 

acacatatta acaacaaatc tctatcagta taacccaaag gaagctaaac accacctcac 33 60 

acttgacaat gtttcctgta taattcaaac attacaaata agcctaatat aagcctaata 3 42 0 

tgtcactctt gaacthcagg aagcctaata tccaaaaagt tagtttaagg tcaaaagttt 3 480 

ttgaattaac ttttttccat tagtatggtc atatctttct tactaatttg taagttatgt 3540 

aatttatcaa tttttttttg ttgttctgtt tcccaacctc tatgtcagat aaagaatcac 3 600 

ccaggccaga cacagtggct catgcttgtg gtcccagcac tttgggaagc caaggtggga 3 660 

gaattgcttg aagccaggaa tctgagccca gcctgggcga caaagcaata cccctatctc 372 0 

tacaaaaaat aaaaaatagc caggtgtggc gacacacacc tgtggtccca gctgctcggg 37 80 

aggctgagcg ggaggatggc ttgggcccag gggttcaacg ctgcagtgag ctgtgattgc 3 840 

gccactgcac tccagcctgg gcaacagagt aagaactgtc tcaaaaaaaa taaaaaatag 3 900 

aaataaattt taaaaaaaga attacccata ttctctttgt ttttgtttat tcacattaac 3960 

ctttattcta tctggaattt atttgagtat acttttttct caaataatca attgtcctag 402 0 

aaccatgtgt ttctcattta tttgaaaggc catctagtga gagatttctc caaatgttgg 4080 

ggtagggaag ggaggggaag cactttaaag tctgagcctt tagaggtgat tcctcaagac 4140 

cctgcttaat cctaacaatt ttcctcatta gtaaaagtca gcccaaactg ggggcttgtt 42 00 

aagatcctta ccagccacat ccatctgaaa ttatgaattt caaagtatct tacaaatttg 42 60 

gtgccacatt atctttttta agtttgtttt gttttgtttt tttgagacag agtctcgctc 432 0 

tgtcacccgg gctggagtgc agtggcgcga tctcagctca ctgcaagctc cgcctcctgg 43 80 

gttcacacca ttctcttgcc tcggcctccc aagtatctgg gactgcagtc gcccgccacc 444 0 

acgcccggct aatttttttg tatttttaat agagacgggg tttcactttg ttagccagga 450 0 

tggtctcaat ctcctgacct catgatccac ctgcctcggc ctcccaaagt gctgggatta 456 0 

caggcaggag ccaccgcgcc tgggcctttt tttaagtttt aagtacctat aaagaacact 462 0 

gaaaggtgat gtgtgtggat gagctaggaa gacctgaaat aggctctctc taaattaatc 4680 

aaattaatcc tgaagccatt ctgcaatact gtctttaatg tatactcact tgttatagaa 474 0 

gccagggttt tttcccctaa tttgtatcat tgctatatgt gttattgtac caaactacac 4800 

tgttttaatt gctgtaaatt ttaatatgtc ttagtatctg ggtgtgggaa tcttgaaagc 4860 

atggagtttg tgttattcac cactgtattc tcaaatatca gaagagtatc tggcctacta 492 0 
agtgcacaat aaacatagtt aaaatg 4946 

<210> 2 

<211> 60 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> AB032969 
<400> 2 

taatcctgaa gccattctgc aatactgtct ttaatgtata ctcacttgtt atagaagcca 

<210> 3 

<211> 1007 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> AF005487 



60 
120 



480 
540 
600 
660 
720 



<400> 3 

gaatacagaa tgtgggcaaa ctcgcttctg tgccggccgc cagaaggttt gctgagggca 
atcactccct ggtgccgggc tccttgaggt tatgcactgg gacatctaga gcctattgtt 
tgaggaatgc agtcttgcaa gcctgctctg gatcaagcca cagactgaaa cacccccgaa 180 
gagcaagcac gtttcttgga gcaggctaag tgtgagtgtc atatcttcaa tgggatgaag 2 40 
cgggtgcagt acctgaacag atacatccat aaacgggagg agaacctgcg cttcgacagc 3 00 
aacgtggagg agttccaggc agttacggaa ctggggcggc ctgtcgcaga gaactggaac 3 60 
agccagaagg gcatcccgga ggagaagcgg gacaagatgg acgactactg cagatacaat 42 0 
tacggggttt tttgagagct tcacagtgca gccgcgagtc catcctaagg tgactgfcgta 
tcctgcaaag acccagcccc tgcatcaccg caaccccctg gtcggctctg tgagtggttt 
ctatccaggc agcattaaag tcaggtggtt ccagaatggt caggaagaga aggctgcggt 
ggtctccata ggcctgatcc agaatggaga ttggaccttc cagaccctgg tgatgctgga 
aacagttcct cggagtggag aggtttacac ctgccaagtg gagcatccaa gcgtgacgag 
ccctctcaca gtggaatgga gtacacggac tgaatctgca cagagcaaga tgctgagtgg 780 
agtcgggggc tttgtgctgg gcctgctctt ccttgggaca gggctgttca tctacttcag 840 
gaatcagaaa ggacactctg gacttcagcc aacaggactc ctgcgctgga ctcctgagct 900 
gaagtgcaca tgaccacatt caaggaagaa ccttctgcca cagctttgca ggatgaaaag 960 
ctttcccact tggctcttat tcttccacaa gagctctctc aggacca 1007 

<210> 4 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF005487 



60 



<400> 4 

tttgcaggat gaaaagcttt cccacttggc tcttattctt ccacaagagc tctctcagga 

<210> 5 

<211> 3200 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF026941 

<400> 5 an 
caggaagggc catgaagatt aataaagatt tggactcagg gcaaatattt acttagtagc bu 
aataactcaa agaattactg ttgaataaat aagccaatta agcagccaat cacgtactat 120 
gcggatgcac acaaatgaaa ccctcacttc aacctgaaga cattcgcaca tgagttacgt «n 
agagggacct gcaggaagcg gtagagaaaa cataaggctt atgcgtttaa tttccacacc 
aatttcagga tctttgtcac tgacagcagc actaagactt gttaacttta tatagttaag 
aagaacaagg ctgagcgcga tgactcacgc ctgtaagcct agaactttgg gaggccaaag 360 
caggcagact gcttgagccc aggagttcca gaccagcctg ggcaacatgg caacacccca 42 0 



180 
240 
300 
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tctctacaaa aaaatacaag aatcagctgg gcgtggtgat gtgttcctgt aatctcagct 480 

actcgggagg cagaggcagg aggattgctt gaacccggga ggcagaggtt gtagttagcc 540 

gagatctcgc cactgcactc cagtctggac gacagagtga gactcagtct caaataaata 600 

aataaataca taaatataag gaaaaaaata aagctgcttt ctcctcttcc tcctctttgg 660 

tctcatctgg ctctgctcca ggcatctgcc acaatgtggg tgcttacacc tgctgctttt 72 0 

gctgggaagt tcttgagtgt gttcaggcaa cctctgagct ctctgtggag gagcctggtc 7 80 

ccgctgttct gctggctgag ggcaaccttc tggctgctag ctaccaagag gagaaagcag 840 

cagctggtcc tgagagggcc agatgagacc aaagaggagg aagaggaccc tcctctgccc 900 

accaccccaa ccagcgtcaa ctatcacttc actcgccagt gcaactacaa atgcggcttc 960 

tgtttccaca cagccaaaac atcctttgtg ctgccccttg aggaagcaaa gagaggattg 1020 

cttttgctta aggaagctgg tatggagaag atcaactttt caggtggaga gccatttctt 1080 

caagaccggg gagaatacct gggcaagttg gtgaggttct gcaaagtaga gttgcggctg 1140 

cccagcgtga gcatcgtgag caatggaagc ctgatccggg agaggtggtt ccagaattat 12 00 

ggtgagtatt tggacattct cgctatctcc tgtgacagct ttgacgagga agtcaatgtc 1260 

cttattggcc gtggccaagg aaagaagaac catgtggaaa accttcaaaa gctgaggagg 1320 

tggtgtaggg attatagaat ccctttcaag ataaattctg tcattaatcg tttcaacgtg 13 80 

gaagaggaca tgacggaaca gatcaaagca ctaaaccctg tccgctggaa agtgttccag 1440 

tgcctcttaa ttgaaggtga gaattgtgga gaagatgctc taagagaagc agaaagattt 1500 

gttattggtg atgaagaatt tgaaagattc ttggagcgcc acaaagaagt gtcctgcttg 1560 

gtgcctgaat ctaaccagaa gatgaaagac tcctacctta ttctggatga atatatgcgc 162 0 

tttctgaact gtagaaaggg acggaaggac ccttccaagt ccatcctgga tgttggtgta 1680 

gaagaagcta taaaattcag tggatttgat gaaaagatgt ttctgaagcg aggaggaaaa 1740 

tacatatgga gtaaggctga tctgaagctg gattggtaga gcggaaagtg gaacgagact 1800 

tcaacacacc agtgggaaaa ctcctagagt aactgccatt gtctgcaata ctatcccgtt 186 0 
ggtatttccc agtggctgaa aacctgattt tctgctgcac gtggcatctg attacctgtg 
gtcactgaac acacgaataa cttggatagc aaatcctgag acaatggaaa accattaact 
ttacttcatt ggcttataac cttgttgtta ttgaaacagc acttctgttt ttgagtttgt 
tttagctaaa aagaaggaat acacacagga ataatgaccc caaaaatgct tagataaggc 

ccctatacac aggacctgac atttagctca atgatgcgtt tgtaagaaat aagctctagt 216 0 

gatatctgtg ggggcaatat ttaatttgga tttgattttt taaaacaatg tttactgcga 222 0 

tttctatatt tccattttga aactatttct tgttccaggt ttgttcattt gacagagtca 22 80 

gtattttttg ccaaatatcc agataaccag ttttcacatc tgagacatta caaagtatct 2340 

gcctcaatta tttctgctgg ttataatgct tttttttttt tttgctttta tgccattgca 2400 

gtcttgtact ttttactgtg atgtacagaa atagtcaaca gatgtttcca agaacatatg 2460 

atatgataat cctaccaatt ttcaagaagt ctctagaaag agataacaca tggaaagacg 2 52 0 

gcgtggtgca gcccagccca cggtgcctgt tccatgaatg ctggctacct atgtgtgtgg 2 580 

tacctgttgt gtccctttct cttcaaagat ccctgagcaa aacaaagata cgctttccat 2 640 

ttgatgatgg agttgacatg gaggcagtgc ttgcattgct ttgttcgcct atcatctggc 2 7 00 

cacatgaggc tgtcaagcaa aagaatagga gtgtagttga gtagctggtt ggccctacat 27 60 

ttctgagaag tgacgttaca ctgggttggc ataagatatc ctaaaatcac gctggaacct 2 820 

tgggcaagga agaatgtgag caagagtaga gagagtgcct ggatttcatg tcagtgaagc 2 880 

catgtcacca tatcatattt ttgaatgaac tctgagtcag ttgaaatagg gtaccatcta 2 940 

ggtcagttta agaagagtca gctcagagaa agcaagcata agggaaaatg tcacgtaaac 3 0 00 

tagatcaggg aacaaaatcc tctccttgtg gaaatatccc atgcagtttg ttgatacaac 3 060 

ttagtatctt attgcctaaa aaaaaatttc ttatcattgt ttcaaaaaag caaaatcatg 3120 

gaaaattttt gttgtccagg caaataaaag gtcattttaa tttaaaaaaa aaaaaaaaaa 3180 
aaaaaaaaaa aaaaaggcca 3 200 

<210> 6 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF026941 



1920 
1980 
2040 
2100 



<400> 6 

atttttgaat gaactctgag tcagttgaaa tagggtacca tctaggtcag tttaagaaga 

<210> 7 
<211> 1799 



60 
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<212> DNA 
<213> Homo sapiens 

<300> 

<308> AF035284 
<400> 7 

gcttgaaccg gggaggtgga ggttgcagtg agctgagatc acgccattgt actccagcct 
gggcgacaga gcaagactcc atttcaaaaa aaaaaaaaaa aaaaaaaatc cactcatata 
aaaggtgagc tcagctcact ggtccatttc tcagtggctt ctccatcctc atttgcaaac 
ctcagaggga taaggcagtt gaacctgatg agcaagaatt ataacagcaa ggaaacatta 
atgcttagaa ttctgagatc cagcacaact cagtctgtgg gagctcagct cgctgcccag 
ggataggtat gacctatgtc tgccttaggc tgctgggaga tgccattctc cagtttcaga 
agcaggcagg gcaaaggtca agactgtggt attggggtct tttggctctg aaggatcctg 
gaaccactga ttttggttta ttccctccag ggtctaaaga gaacaagagg tgctagctct 
taccaaaaca gatggtagag agagttgctg gctatttaaa aagctctttc atcttttaat 
tcacctcttc ttttcacctc tttaaccact cctcaggaac agaacacttc taggactggg 
ggtcttttag ctccataagc aagtgagcag atgggacaag ttagtctttt ctccctagaa 
acaaagggga tgcccagtgg tttccctttg cttcccaacc taaaatttca agtttaataa 
aatagcaatt agcagaagtg accaaattgg gagataatta tcagtcatga ggaaagacac 
agatttcggt cataaagaat gtaagggcta taagtagaaa ctttctataa cctaaatgat 
gttatagaat tatttttgag caggagcaga aagattaaat atgatcactt catacttcta 
aatcagaaat aggaagatta aaaccacaga acagtttgtg atttctattg ctggtagcta 
ggtatcttac tctgtccact cttgttcaag tatctaactc ttctggaaac caaataggct 
ttagaagaga ttatcctata ttcctatcag tataatacta aaatgtaact ttttaatcat 
ctggttttta aaagataaac agtttagccc atctctccag agagcaaaca taggaatatg 
actcaggagc ctcctagggc ttatcatcag ccctcacacc cgcttccccc tccaacccac 
agcctttgct tccaggtggc aggattacta ctttgcctct tcagcagcat ctactctagg 
catattgatc attttagaca ctgggagaag agaacctcaa actaggagga aaagacagag 
cctccactta gttttgggag gggatggcag acagtcaagg agatgagcgt cctaaggcat 
gttgggatag ggtcagatgc accacccatg gagaggtttg tcaacacaaa gacatggaag 
gttagaggtt tgtcaacaaa aagacatgga aggttaggtt tgtcaacaca aagacatgga 
agattagagg tttgtcaaca caaagataca ggaagaatgg gctgcagaag atttagatgt 
tttccatttg ggcacatttt acttagctgg agaactaggt ttaaaacagc ctgggtagga 
aaattagaag caagctggat gcagtggctc atgcctgtaa tcccaacact tttgggaggt 
ccaggcagga ggatcacttg ggcccaggag gtcaagcctg cagcgagctg agatcacacc 
actgcactcc agcctggggt gatagaacaa gaccctgtct caaaaaaaaa aaaaaaaaa 

<210> 8 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AF035284 
<400> 8 

caaaaagaca tggaaggtta ggtttgtcaa cacaaagaca tggaagatta gaggtttgtc 6 0 

<210> 9 

<211> 1380 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF052162 
• <400> 9 

gtcaaaggat atttatttat aggccttttt ttttttaata tagaatctga ggctgtttgg 60 
gctttgactt aaatttccat caggcctctc tccagcaggt aatccctctc cttccgctgg 12 
gtcccctggg gaggtgtgaa ctcaagggcc tagccccaaa acactttttc tgcttttctt 18 
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aatccttttc cagtcccctc tttttttata aacgttggca gtttgatgtt tctgtttcgg 240 

cataacgtaa tccatttcac tgtagcctaa actccagtcc gaggttggat attgttcaaa 3 00 

tgagcagggc ccgagctgga agcgcaaggc agccgccgcc gtgccgctcc tcccttgccc 3 60 

tcaggccagg tccctgctgg aagcggctgc atcttcctgt cagccctggt ttccatggtg 420 

actggcgtca cgcagccacc cgagtatggc tgaccttcct gcagagagag gagccgcagt 480 

cttttgcttg tggaaggaga cgctgggctg tgcggtgcgg agggtgatga ggatgtctgg 540 

tgacagccgt gcggacacca ctcctctctg cagcactgcc tcccagcgcc agggtcgcgg 600 

gcacatccca ctgagagcgg gggtcctgcc ccatcttaga gtcaaaggca gaggggcttc 66 0 

caggccctgg atggggtatt ttggtgtcac ctgaagtccc tctgacatca ccttgtttca 72 0 

tcatttttta tgacagaatt agaaacccat ccttcaagca caataatcat cacagacttg 780 

agtttgcttc ctaaagcaaa ggctccgggt ttgtttggaa aatttttttg atttctgaaa 84 0 

tgaattgatt tttatatttg gggcatctct atagaaagtg accaccaagg ccagtaagta 900 

cgggaaaaaa tgtttactaa cttcctcaga gattcgtgat acgcgtttct ccactgacag 96 0 

acatttaaaa acaaccttca gctccgtttc aatcaatcac ctcgacttgt tttttagcat 1020 

ggacactgcc agcaggacag acagggatgg agtaaaccga agtcaatttc agggctcttg 1080 

gcgtgttgga cacagaagaa atcctagtgc agcctttggt agctaacagt cactgatttt 1140 

ataattggag aatgcgtaaa gattcatttt tcaaggagaa gagcctgcaa atggccaatg 12 00 

aaggaggtaa ataaactaag atattccgag ggaagggacc caggccacct cccttccgca 12 60 

ggtctgcaga tgaagggttt tttgaatgaa atgccactgt gcattttcag aaaaaaaaat 13 2 0 

ctctgataaa cagactttga atggaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 13 80 



<210> 10 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF052162 



<400> 10 

cagtaagtac gggaaaaaat gtttactaac ttcctcagag attcgtgata cgcgtttctc 



60 



<210> 11 
<211> 1722 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AF055033 
<400> 11 

ggggaaaaga gctaggaaag agctgcaaag cagtgtgggc tttttccctt tttttgctcc 
ttttcattac ccctcctccg ttttcaccct tctccggact tcgcgtagaa cctgcgaatt 12 0 
tcgaagagga ggtggcaaag tgggagaaaa gaggtgttag ^gtttggggt ttttttgttt 180 

300 
360 



60 



o^yaaya-yya. ^ ^ ^ ^^-^^^J ^-ass — =s ^ — 

ttgtttttgt tttttaattt cttgatttca acattttctc ccaccctctc ggctgcagcc 

aacgcctctt acctgttctg cggcgccgcg caccgctggc agctgagggt tagaaagcgg 3 00 

ggtgtatttt agattttaag caaaaatttt aaagataaat ccatttttct ctcccacccc 360 

caacgccatc tccactgcat ccgatctcat tatttcggtg gttgcttggg ggtgaacaat 42 0 

tttgtggctt tttttcccct ataattctga cccgctcagg cttgagggtt tctccggcct 480 

ccgctcactg cgtgcacctg gcgctgccct gcttccccca acctgttgca aggctttaat 54 0 

tcttgcaact gggacctgct cgcaggcacc ccagccctcc acctctctct acatttttgc 60 0 

aagtgtctgg gggagggcac ctgctctacc tgccagaaat tttaaaacaa aaacaaaaac 6 60 

aaaaaaatct ccgggggccc tcttggcccc tttatccctg cactctcgct ctcctgcccc 72 0 

accccgaggt aaagggggcg actaagagaa gatggtgttg ctcaccgcgg tcctcctgct 7 80 

gctggccgcc tatgcggggc cggcccagag cctgggctcc ttcgtgcact gcgagccctg 840 

cgacgagaaa gccctctcca tgtgcccccc cagccccctg ggctgcgagc tggtcaagga 900 

gccgggctgc ggctgctgca tgacctgcgc cctggccgag gggcagtcgt gcggcgtcta 960 

caccgagcgc tgcgcccagg ggctgcgctg cctcccccgg caggacgagg agaagccgct 102 0 

gcacgccctg ctgcacggcc gcggggtttg cctcaacgaa aagagctacc gcgagcaagt 1080 

caagatcgag agagactccc gtgagcacga ggagcccacc acctctgaga tggccgagga 1140 
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gacctactcc cccaagatct tccggcccaa acacacccgc atctccgagc tgaaggctga 12 0 0 

agcagtgaag aaggaccgca gaaagaagct gacccagtcc aagtttgtcg ggggagccga 12 6 0 

gaacactgcc cacccccgga tcatctctgc acctgagatg agacaggagt ctgagcaggg 13 2 0 

cccctgccgc agacacatgg aggcttccct gcaggagctc aaagccagcc cacgcatggt 13 80 

gccccgtgct gtgtacctgc ccaattgtga ccgcaaagga ttctacaaga gaaagcagtg 1440 

caaaccttcc cgtggccgca agcgtggcat ctgctggtgc gtggacaagt acgggatgaa 1500 

gctgccaggc atggagtacg ttgacgggga ctttcagtgc cacaccttcg acagcagcaa 1560 

cgttgagtga tgcgtccccc cccaaccttt ccctcacccc ctcccacccc cagccccgac 1620 

tccagccagc gcctccctcc accccaggac gccactcatt tcatctcatt taagggaaaa 1680 
atatatatct atctatttga ggaaaaaaaa aaaaaaaaaa aa 1722 



<210> 12 
<211> 60 
<212> DNA 
<213> Homo 



sapiens 



<300> 

<308> AF055033 
<400> 12 

tccaccccag gacgccactc atttcatctc atttaaggga aaaatatata tctatctatt 60 



<210> 13 
<211> 1411 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> AK001166 



<400> 13 

aaacaaagag 

attttcccgt 

tagattggta 

gcagacctct 

agctgatatg 

atttgaatat 

cataacagat 

atcctatcct 

tcctgaaaaa 

accatttcaa 

gagaccactg 

aaagaaaagc 

aatgtttggc 

taaaatttafc 

ttaagggtat 

atacatccta 

atagcgtgta 

agtagccaat 

ttaatgtctt 

gattttacta 

gagtagattt 



atgccacccc 
tgcatcttgt 
acgtttctga 
atagaggagc 
gatatcactt 
caaagatcat 
gccaaactct 
gaagtctatc 
cccaaaccga 
agaactagaa 
ttgttgtttt 
ataaaacaaa 
cattagtatt 
gagtcaaata 
ttttatctct 
gctactgata 
tacttgaata 
tatcatttct 
atattgtgat 
aagctagtgt 
tcatctgcct 



tgtgtgatgg 
gttccaagga 
tggacaatta 
gtgtggctca 
tatctgctcc 
atggctctca 
ccaacaaaga 
aagaacgatt 
aaccacagct 
gttttcgaat 
gagtgaatgg 
gactactgaa 
tttataaaac 
tatagtgata 
gtgctttctt 
tctttattat 
agaaagacac 
cctgtataaa 
agctatgtct 
ttttataaca 
gcaatggtat 



ctttggtacc 

tgaagtggac 

ccaggaaatt 

tctacgaaga 

atcattttgc 

ggaacctctg 

gaaaaagaag 

tcctacacca 

gctaatgtgg 

gtaataatac 

tggttaggag 

atatagataa 

tcaatgctag 

atgttaacat 

tttcatggtg 

agccttaaga 

tgggtactgt 

ttccagtttt 

tttattgcag 

tatatattag 

aatttcagtc 



cgaacactga 


tggttcagac 


60 


ttggatgagt 


tattagctgc 


120 


ctgaaagtcc 


ctttggcctt 


180 


gtccagataa 


aatacccagg 


240 


cgtcaaatta 


gtccagagga 


300 


gcagccttgt 


tggaggaagt 


360 


aaactgaagc 


agtttcagaa 


420 


gaaagtgcag 


cacttctgtt 


480 


gcactaaaga 


agcctttcca 


540 


ttccacagca 


acaggtgcta 


600 


aaagactttg 


gtggtggaag 


660 


agattgcctt 


agtttttaaa 


720 


ttttaagtgt 


ataaattggt 


780 


gtttgtaatt 


gctacagaat 


840 


tttattaaat 


aattgtgtat 


900 


cttaatttta 


agtcttaaaa 


960 


tactgtgatg 


ctattgactt 


1020 


tattgctgca 


cataaatttt 


1080 


atttattgga 


tgttatgaca 


1140 


ttgatgttta 


cctataagtg 


1200 


ttagctaaaa 


atggaaagtt 


1260 


ctaagtcaaa 


tgcaaatggg 


1320 


atccttttga 


aagtaagtga 


1380 



aatgtaaata aacttatttt ttttaaaaat g 1411 



<210> 14 
<211> 60 
<212> DNA 
<213> Homo 



sapiens 
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<300> 

<308> AK001166 



<400> 14 

acccttagac ctctgattct aagtcaaatg caaatgggtt aaataaaatg agactacttc 60 

<210> 15 

<211> 2352 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL049367 



<400> 15 

ggcaaacccc ttttaaaatc taatgtctgg gctttgagta ttagctcatt tagggtggac 60 

aaatgcatta ctgttttcaa actgctcaca tttattcagt atttctccaa gttgctatct 12 0 

actcagcctt atgaatgccc ctcgcttttc taaggccatg tgaaaatcac ggcactgccc 180 

ttagccttgt gtcatctgct ttttcgttct gcgatatgcc cagttcccaa atcaattata 240 

ggtacctgtt taggagagag gaagatttta cctctcaaag ggtgagattt gaaatttaca 300 

ctaaaaagac aactttacat ttaatgcttc acttaatgag acattctttt ttttataagt 360 

ctatttttct actcagtttc agaacactaa tctgattttc actctgattt ttaacgtttc 42 0 

tttaaatatt tataatgtag cttctttcaa aatattttca tgaaaaatta cttttattat 480 

accattatgt gcatgttatt ggtagcaggc atagtttatt atttagtact gaaacatgct 540 

cttttaccta acagtaaaca agtatgtttt gatatatatc tgttaatatg cttatagtgg 600 

taagaaatgg acttgaggtc ccaggagatt tcattttatt caccctggtc agatacaata 660 

aaggctatga gtataaatac ataacttcct aaccaggtgt agggcatgtt catgaatatc 720 

aaatcttttg atgctggacc caagagagga aaagttgtag ctaaatgttg atttacttat 7 80 

aactagacgt ctatgtgaga aaatatatgt atacatatat atgatatgca gaagtcactt 840 

tttttatcag gctttattct ccttacaaag ccacagttta actgtctgca acagttggtt 9 00 

tatgttaatg atagacaaat acccagtgtt tgttactttt tccaactacc actgtaatga 9 60 

taatctttct cacgtatata catgcaactt cttggcttca tttccatgaa gctgtttcaa 102 0 

tatattcagt atactttgtc cttaatgctg cttctgttaa cagtgatctc tttctttttt 1080 

tcattcttat atcttcatta gttcatcata aatctgtcca gttgaggcct caggaccacg 1140 

gcatgatttc atgactccga agtattttac agaaacattt tttaaataag ggaaatattt 12 00 

tatataccag atggttcaca agtgatggct catagctagt tttttttttt tcttctaaaa 12 60 

aatgtcaggt ttttaaaatc atttacctta ttaaaatgaa aagtgccata cttaactttt 1320 

aaaggaaaga cctgacttgc tttttctcta tttagactgt ttttgtactt tactaatctt 13 80 

taaactatca ggaaaaaaac caaaacttta taccaatgat ttagtaattt tgaggcatag 1440 

ggtagcttac gtagtggagg atgtgccaaa tattctcttc aaatgccacc ttctcaattt 15 00 

ataactaaaa tagtgttatc tgactaattc ctctgaattt tgatgtaaga tctatatagg 1560 

cccccaaaat gatcgtagta catgccagtc atttctcagt gaaataaata caataccaga 162 0 

gtacattatg ggttttattg ctttctttta tggtagacct gttaatgggg aaaaaataca 1680 

tcaaatcaaa tagaatctta tatctgtatg ttaaaataga gcacttacct gaagtcagtg 1740 

gcctggatca tagccctgga tcatttccca gtctgtcctg tgctgtgtga ccttggacaa 1800 

ggcgcttcat ctctctgggc ctctatttct ccatttgtaa aacaagtggc tgcagtagat 1860 

gatggctgag agcccttcct gttcccagat gccttggtcc aaagacccca cccctctgct 1920 

ggtcctgcca acgtgttggt gctataagct gcttcagata taaaattggt ttatctataa 1980 

tgtttgttca tttaatagct tctaaaaggc ctttttgtta tacagtgctt tttttctagt 2 040 

tttatggact tgattactgt aataatgtct tgtttttagc catgtaacta caaacagata 210 0 

ttctcttgat gtcttagtaa atttgcattt gatatatcat tgatgagatt ttgttgttat 2160 

gtaatattct ttggctacgc atctgtccag catcttatta accataatac tgtgatcatt 222 0 

atttggaaat atgtcctatg gaaagaataa aagcatgtac ttcacagcta gcatgttcac 2280 

agatttgaaa gaagtttcat taaaagcacc attgctttct gtaaaaaaaa aaaaaaaaaa 2340 
aaaaaaaaaa aa 23 52 



<210> 16 
<211> 60 
<212> DNA 
<213> Homo 



sapiens 
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<300> 

<308> AL049367 
<400> 16 

atttggaaat atgtcctatg gaaagaataa aagcatgtac ttcacagcta gcatgttcac 60 

<210> 17 
<211> 1130 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AL080235 
<400> 17 

ggtcgccgca ccggccgcct ccggcccgcc gccgccccca gcgccgccgc cgccaccgcc 
ggggcgccca ccgcgctgcc agcctacccc gcggccgagc cgcccgggcc gctgtggctg 
cagggcgagc cgctgcattt ctgctgccta gacttcagcc tggaggagct gcagggcgag 
ccgggctggc ggctgaaccg taagcccatt gagtccacgc tggtggcctg cttcatgacc 
ctggtcatcg tggtgtggag cgtggccgcc ctcatctggc cggtgcccat catcgccggc 
ttcctgccca acggcatgga acagcgccgg accaccgcca gcaccaccgc agccaccccc 
gccgcagtgc ccgcagggac caccgcagcc gccgccgccg ccgccgctgc cgccgccgcc 
gcggccgtca cttcgggggt ggcgaccaag tgacccgctc cgctcctccc tgtgtccgtc 
ctgtgtccgc gcgcgcgggt gcctttcccg ccggggactc ggccggtgtg cttcgtgctg 
tagttatcgt tagttcctct tcccgagatg gggccgccga gagaccccag cgcctttgaa 
aagcaaggtt tgtgctgcgc ttccagttcc gaaaagcaga tgtttaagcc cttggactga 
gggtgggatc gcagctccga agacggagag gagggaaatg gggccctttc ccctctattg 
catccccctg cccgactcct tccccgcacc cacgtgccct agattcatgg cagaaaatga 
ccaaatcctg tgtatttgtt ttatatattt aataactgtt ttaaatgaaa gttttagtaa 
aaaaaataca aaacaaaaag attaaattgc tattgctgta gtaagagaag ctctttgtat 
ctgaacatag ttgtatttga aatttgtggt tttttaattt atttaaaatt ggggggaggg 
catgggaagg atttaacacc gatatattgt taccgctgaa aatgaacttt atgaaccttt 
tccaagttga tctatccagt gacgtggcct ggtgggcgtt tcttcttgta cttatgtggt 
tttttggctt ttaatacaga cattttcctc caaaaaaaaa aaaaaaaagg 113 0 

<210> 18 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AL080235 
<400> 18 

ctttgaaaag caaggtttgt gctgcgcttc cagttccgaa aagcagatgt ttaagccctt 

<210> 19 
<211> 2498 
<212> DNA 
< 2 1 3 > Homo sapiens 

<300> 

<308> AL137540 
<400> 19 

gctgaaacga cagtcttgtc cctgtcagag aaatgacctg 
tacacactat gcaatctatg atttcattgt caagggcagc 
tgatcaatgc atacctgttc atggcttcag acctgtcaag 
ggtccatggg aagtgtatgt gtaagcacaa cacagcaggc 
cccgttatac aatgaccggc catgggaggc agctgatggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



aacgaagagc ctcaacattt 60 

tgcttctgca atggccacgc 120 

gccccaggaa cattccacat 180 

agccactgcc agcactgtgc 240 

aaaacggggg ctcccaacga 300 
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gtgcagaacc tgcaagtgta atgggcatgc tgatacctgt cacttcgacg ttaatgtgtg 
ggaggcatca gggaatcgta gtggtggtgt ctgtgatgac tgtcagcaca acacagaagg 
acagtattgc cagaggtgca agccaggctt ctatcgtgac ctgcggagac ccttctcagc 
tccagatgct tgcaaaccgt gttcctgcca tccagtagga tcagctgtcc ttcctgccaa 
ctcagtgacc ttctgcgacc ccagcaatgg tgactgccct tgcaagcctg gggtggcagg 
gcgacgttgt gacaggtgca tggtgggata ctggggcttc ggagactatg gctgtcgacc 
atgtgactgt gcggggagct gtgaccctat caccggagac tgcatcagca gccacacaga 
catagactgg tatcatgaag ttcctgactt ccgtcccgtg cacaataaga gcgaaccagc 
ctgggagtgg gaggatgcgc aggggttttc tgcacttcta cactcaggta aatgcgaatg 
taaggaacag acattaggaa atgccaaggc attctgtgga atgaaatatt catatgtgct 
aaaaataaag attttatcag ctcatgataa aggtactcat gttgaggtca atgtgaagat 
taaaaaggtc ttaaaatcta ccaaactgaa gattttccga ggaaagcgaa cattatatcc 
agaatcatgg acggacagag gatgcacttg tccaatcctc aatcctggtt tggaatacct 
tgtagcagga catgaggata taagaacagg caaactaatt gtgaatatga aaagctttgt 
ccagcactgg aaaccttctc ttggaagaaa agtcatggat attttaaaaa gagagtgcaa 
gtagcattaa gatggatagc acataatggc acttgtctat gtacaaaaca caaactttag 
agcaagaaga cctcagacag gaaactggaa ttttttaaag tgccaaaaca tatagaaatg 
tttgaatgca tgggtcttat ctaacttatc tcttctggac ccatgtttaa atacagtttt 
atttcatgaa gagaaatgaa aacccctaca ctgatatctg ttttctatgg gactgattct 
gaaattctta actattaaga atattttaat agcagcatga catttagcag taatccatta 
agggcagtac ctctaacaag gacgccttcc agcttcagcg atgttactta cgtttgatgc 
tacttaaagt aatgaatgac gttttaagga atccctaacc ctactatcag aaaaggtgtt 
tgttaaagag ccttctcttg tgtgttacgc atgaactttg gtctgtaggt gttaaatgga 
acctctccat gtgtatatag tatttccttg tataaagcac tttactacct accacttgtg 
ttgtgaacgt ttggtgactg ctgttgaaag aaggaaaagg gtgtgtgaga aagcctactg 
aagcagcagc actgccacta catgtggaca aaagtgacca tataaaagaa gttgtgctat 
ttaactctga atacttggag aaactaggtg aagatgcaac cagaaaggag aatatgtatg 
cgtgaagtct cagctttgag ctggaggcta gattccaaga tgacagccat gatgaaactt 
tttaaaaaac taaaccagaa gagactttaa aataagagaa agaaatcata aatgtagaca 
tatgcttggc taaaggggaa atggacttta aattttaaag agctcatttg caatgcactt 
gtatacactt caaaaattat tgtagacaca gaatttgtta tatttttgtg cttagtattt 
aaacctgaac attgaaacag ttttcctcct tgtctttctt aacagtaata gtcattatat 
ttacctgttt tttaacacaa tgtatgtgat agtcaaaaaa tcacagtttt tcattattat 
tcatcttctg tacccacgca taaccactat acatagtttc ttttgtactt gaatatacaa 
aacatgaaca cagtgccata tgaataattt cacatacaga accttttttt ctctgaagtc 
ctgtggactt gcaaatatat atatatattg ctttgttaat ttgtttttat atttcatata 
tgtaataaag gaatatgatc tgaaaaaaaa aaaaaaaa 249 8 

<210> 20 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AL137540 
<400> 20 

tggaggctag attccaagat gacagccatg atgaaacttt ttaaaaaact aaaccagaag 

<210> 21 
<211> 914 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AL160131 
<400> 21 

cgcaccgcag gagcaacggt tggtcctgcg gctgtgatgt cggtgttgag gcccctggac 60 
aagctgcccg gcctgaacac ggccaccatc tfcgctggtgg gcacggagga tgctcttctg 120 
cagcagctgg cggactcgat gctcaaagag gactgcgcct ccgagctgaa ggtccacttg 180 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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gcaaagtccc 
gtggttaatc 
gatgccagct 
cactgcagca 
ctctactgtg 
gtgctgcaga 
ctgagaagct 
ccccttctca 
ggcttgacag 
tgcagcaggg 
ctcccactcc 
ctccatttaa 
ggtctcattc 



tccctttgcc 
ttcacagcaa 
tcttcttggg 
ttcaccggca 
acctggaggt 
tctgtgctgg 
ctgagggccc 
tggcttcgtg 
gcccaggctc 
agagctgggc 
cacactcact 
ggtfctccttt 
acca 914 



ctccagtgtg 
atacagtctc 
gaaggtgtgt 
caccgtggtg 
ggaaggcttt 
ccacgtgccc 
ctccctggag 
ctgactccat 
agcccgccgt 
agaagcagcg 
cttgcagagc 
gaacaaaagg 



<210> 22 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL160131 



aatcggcccc 
cagaacacag 
ttcctcgcca 
aagctggccc 
agggccacca 
ggtgtctcag 
gacctgtgag 
aaacattctc 
ggctgggaag 
agggggccca 
ctgtgtcttt 
tctgtggcta 



gaattgacct 
aggagtccct 
caggtgctgg 
acacctatca 
tggcgcagcg 
ctctgaacct 
ggtggctggc 
tgttgaggat 
gttccctgca 
gctggcgaga 
aagcagctgg 
aaaaaagttt 



gatcgtgttt 
gcgccatgtg 
gcgggagagc 
aagccccctg 
cctggtgcgc 
gctgtccctg 
ccctgggctg 
gtccagtcag 
gtgccagtgc 
ctgtagcccc 
cgtgttacat 
aaaaatcact 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



<400> 22 

agctggcgtg ttacatctcc atttaaggtt tcctttgaac aaaaggtctg tggctaaaaa 



60 



<210> 23 

<211> 4753 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> D13642 



<400> 23 

cttcaatcaa 

gagtaacaac 

tcagagaaag 

gctggccgca 

agctcccaag 

agggaacaca 

gtgcaggttt 

gatactaaac 

tggggaaaaa 

cccattccag 

aaagaagaag 

ccagactatt 

ctacaagcgt 

cactacagcc 

atgtgtggtg 

caaagccctg 

gaactaccat 

ctcagaatca 

gtcccatgag 

ccctctctgt 

gattgatgga 

ctctgaagaa 

ccgctacgcc 

ttcccccacc 

attaagactg 

tggcttcccc 

catgacccca 



gtagccttcc 

cttattatca 

cagcagatgg 

gagatggcag 

gctggcaatg 

ctggaccttg 

tccaacactg 

ccccgatctg 

ctggagtttt 

gggagggtgt 

ttactccgaa 

ggacataggg 

aatgaaaacc 

agcctcctgg 

aggctcccac 

tgggaccgtg 

gtcggggaga 

cttgtcfcata 

gaccatgact 

gggcgggacc 

gacctctgtg 

ctggaccgaa 

ttctgagccc 

accatcactg 

cattatgaaa 

tcaaattggc 

ggttccagtg 



cactgcagta 

ttgaaacgga 

cagaggaaat 

cagcattcct 

ggcagtgggc 

tccagctgga 

gtgaagactg 

tggcaggggg 

tgcacaagac 

tgattggtgt 

aatgtgagaa 

taattgtatc 

agcttatcat 

actatgacac 

ctaacaccaa 

gcttgctcaa 

cggtgctgtc 

ccaccttgtc 

tcttccagca 

acctcagctt 

agcagttcaa 

ccccacccga 

tcctttcccg 

ccacctggct 

gtcaacagct 

actgagattt 

tagaacctga 



cacacccagg 

ccacaatgcc 

ggtggaagca 

caatgaaaac 

ctctgtgatc 

acagaatgag 

gtatgtgctg 

cttcgtctat 

tcctgtggaa 

ggggaagctg 

taagcatatt 

tgatgtccaa 

ctttgctgat 

tgtggctggg 

tgatgaagta 

tggggcctcc 

cttgcagaag 

tggaggaatt 

tgtggaaatg 

tcgctcctac 

ttccatggaa 

agtgtccaag 

gtggggcttg 

tctgccatgt 

ctttcccctc 

gctacacttc 

gtcccccatt 



aaatttgtca 

tacactgagg 

gcaggggagg 

ctccctgaat 

cgagtgatga 

gcagctttta 

gtgggtgtgg 

acttacaagc 

gaggtccctg 

ttgcgtgtct 

gccaattata 

gaaagtttca 

gatacctacc 

gcagacaagt 

gatgaggatc 

cagaaggcag 

accacgctga 

ggcatccttg 

cacctgcggt 

tacttccctg 

cccaacaaac 

aaactcgagg 

ccagagactg 

ggcaggaggg 

agctcttctc 

tccccacctg 

ccccaaagcc 



tccaccctga 

ccacgaaagc 

atgagcggga 

ccatctttgg 

atcccattca 

gtgtggctgt 

ccaaggacct 

ttgtgaacaa 

ctgctattgc 

atgacctggg 

tctctgggat 

tctgggttcg 

cccgatgggt 

ttggcaacat 

ctacaggaaa 

aggtgatcat 

tccctggagg 

tgccattcac 

ctgaacatcc 

tgaagaatgt 

aaaagaacgt 

atatccggac 

tgtgttttgt 

tgactggata 

ctggaatgac 

gtacatgata 

atccctgcat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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tgatatgtct 
ttgtaaatac 
gactgggtag 
gaaacctagt 
gtgggaggtt 
accgacttgg 
gttttttcct 
cccacccctc 
ctgaggatgt 
cccttttact 
agtacagaag 
tgcagcaccc 
gtggaagagg 
ttgtgtggga 
tcactgttct 
ctaaattgag 
ctgaacctct 
taagtatctc 
aggaatcgaa 
gacagtagtt 
cctaaatcca 
tagatacttg 
tccgtacaga 
tgtcaccacc 
gaagggctag 
tgcaggacct 
tgtaattgtt 
agacctctct 
atctgccctt 
aggactgaaa 
agatgtttcc 
tagttggaat 
cgctggaaaa 
cgctgctgcc 
tctggctttc 
aaattttatt 
agctcactga 
agactacaga 
tccctgtgtt 
ctcccagcgc 
gcctgggcaa 
ggtgcacact 
gagtttgagg 
gaaacctgtc 
aacgcacatg 
cctgggagat 
ttccaagcca 
agtgagatcc 
ccaacggatg 
tgacctgtgt 
gggcattatt 
attgagaaaa 
cactattgtc 



tgactctcct 
agttttgtac 
ggtatagtat 
ttttaaggtg 
tgaacccaag 
acttgcggca 
ggttcttttt 
ttcccactgg 
tagtatgggg 
gatagagcac 
aacctagata 
atctcaggtg 
cccagttcca 
agccctcgtc 
caatacttca 
caaatgatct 
cagagcccag 
ttgaatcatg 
tccctaaata 
ctgaattaga 
ttctatcaaa 
agcagctcct 
aggtaggggg 
gagcttcctc 
gacaaggatt 
acaaaagttt 
aaggggacct 
ccagctttgg 
gggtttgctt 
agtttttagc 
tttggaagat 
gttgcttttc 
ggctcgtctc 
tttcttgagc 
aaattttttt 
agagacagtc 
ggcttgaact 
tgtgtgccac 
gcccaggctg 
tttgagaggc 
catgacaaaa 
tgtagtccca 
ctgcagtgag 
tcaaaaacac 
actgagtgct 
ctgggggacc 
agaggcccca 
tgtggctgtg 
aatcaacgcc 
ggccactgcc 
gtcctgttgc 
catttcatga 
act 4753 



gtctactttt 
aatgttatct 
cactcctgag 
actggcatcc 
ttagagcagg 
ctctgtggct 
gctgtgctga 
gtattgagga 
attttagcat 
ctaaagcgat 
ctgccctgcc 
ggttccagag 
gaaatgttcc 
ttccaggctg 
caaatgtaaa 
ggtgagcact 
aacagagggt 
acttggtttt 
acatgttttt 
ttttgaaaac 
ttgtgtgata 
cgcctctttt 
ttatgtaaga 
tggaagtctg 
tgggcttgaa 
aaaattagat 
gattgactcc 
caaaacccat 
ccatcctgtt 
tgcctcagct 
ctcttgccaa 
ttggttagtg 
acagggagag 
ttgttagagg 
taattcctaa 
tctctctctt 
cctgggctcg 
catactcagc 
gcctcgaact 
tgaggcagga 
ccccatctct 
gtaattaggg 
ctgtgattgc 
caaaaaacaa 
caggggttct 
tgctgtccta 
gagagggcac 
cccccatcac 
ttggcagagt 
ttggggacgg 
aagttcaggg 
acaactactt 



gcacacaccc 
ctgtgggagg 
ttccactgct 
atgtgtcttg 
aagaactgag 
ccccaccccc 
tgaaacatga 
agggtggctg 
gaattccagc 
ctttggctcc 
cctgagaact 
ggcctttagg 
agcccacccc 
tccttgcgcc 
actttctttc 
gggttagaat 
tcctgacact 
agatcagtca 
ttctcactta 
ctaatttcag 
ctgacatgca 
ctaactcaag 
aaggaaaacc 
cccatcagct 
tatgtggaaa 
tggatgtgac 
tgtggtttga 
cagaggctgc 
ccattgctaa 
ttcccctgac 
gaatagcatt 
taaatgtatt 
tgctggtccc 
aaagccagaa 
gttctgtttt 
gcctagctgg 
agcaatccac 
tagtttttaa 
cctgacctca 
ggatcccttg 
ccaaaaatac 
ggctgagaca 
gccactacac 
aaaccggtct 
gaggcttgtc 
tatgtgatgc 
cgtggggtgt 
ctccacccac 
thtccagcag 
gtgaggagtt 
caaaaccagg 
gtggcatgca 



ttaattttta 
aaggaggcag 
ctagaatcta 
ttctggagat 
tagactcctt 
aggtctgtgg 
cctcaataac 
attcttcctc 
tggggagtct 
ataggaccat 
gtgtatatgt 
gtataatgag 
tgagaattcc 
ttgaacctgg 
gtctgcatgt 
caggaatggt 
gtgacactgt 
agagagac cc 
gctcatgaat 
ggctcatttt 
gtcatctgag 
tttgactaaa 
taatctatgg 
tgcttgttct 
ggaattttca 
tcaatgacaa 
ttgagcaacc 
tgcagaactc 
gcccttgtga 
cttactggca 
cctttggagg 
gctagtgaga 
cagaatgtgt 
aggcattcag 
attttttaat 
gagtgcagtg 
ctcagcctcc 
actttcgtag 
aaaaatcttc 
agcccaggag 
aaaaattggc 
ggaggatcac 
tccagcctgg 
cctggggtca 
cgctgacctg 
tttgaaagaa 
tcaggcttct 
tctgccctcc 
ggccttgcag 
agcctggaac 
aatccagttt 
ttggcactcg 



attggttttc 
gctgtggtgg 
accagaaata 
gaggatgtag 
ccttccagat 
tggtttcttt 
catgtgtata 
ctcttctact 
taacagatgc 
aggaagggtc 
ggggcctgtc 
agcctgttag 
tcctgtttag 
agaagtgagc 
gctcagccat 
ggaatacaat 
ctcctggaac 
aggttttgcc 
ttgcatagta 
ttcctgtggc 
gaactcagcg 
atacatacac 
aatcaggagt 
ctgttaagag 
tagttgttgc 
gtcccatctg 
aggtaaatag 
agacagaggg 
cttggatcct 
gaggttctgc 
aggggggttc 
cagctgccgg 
gctgttccca 
atgggatcag 
tttttaaaaa 
gagtgatcat 
agagtagggg 
agacagggtc 
ctgccttggc 
tttgagacca 
caggcatggt 
ttcagcctat 
atgacaggac 
tggtagcaca 
gggctctggc 
aggggcatca 
gtgaggcccc 
cactagctgc 
agagtgtgtg 
attccagcgt 
tgtcgatcca 
gaataaagcg 



1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 



<210> 24 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> D13642 
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<400> 24 

aaaccaggaa tccagttttg tcgatccaat tgagaaaaca tttcatgaac aactacttgt 6 0 

<210> 25 

<211> 2591 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> D25328 



<400> 25 

cccggacgtg 

aagggctcct 

ctgaccagcg 

ggtatctacg 

ggaggctcaa 

gggacgatca 

gctgcttgca 

agcctcaccg 

aggaacggcc 

atggtgggct 

gccctgcaca 

aggaccttcg 

ttggcctgcg 

gagcagatgt 

attgtggctg 

gagcttgtcg 

agaggaggga 

gtcatcgcct 

aaccacgccg 

gcgatggacg 

aacctgaaca 

aattgcaacg 



cggctcccct 

tgcggaagtt 

gcggggatgc 

tgggggccaa 

acatcgcaga 

ttggcagtgc 

acctgctgca 

gggccaacct 

agatcgataa 

ccatcgacaa 

ggatcatcga 

ttctggaggt 

gtgcggactg 

gtgtcaaact 

aaggagcaat 

tcacgcagct 

ccccttcggc 

tgctagaggc 

tgcgcctgcc 

agaggagatt 

cctacaagcg 

tagctgtcat 



cggcctcctc 

cctggagcac 

tcaaggtatg 

ggtgtacttc 

ggccgactgg 

gcggtgccag 

gcgcggcatc 

cttccggaag 

ggaggccgtg 

tgatttctgc 

ggtcgtcgac 

gatgggacga 

ggtgttcctt 

ctcggagaac 

tgatacccaa 

gggctatgac 

attcgacagg 

caccccggac 

gctgatggag 

tcaagatgcg 

acttgccatc 

caacgtgggg 



gccatggacg 

ctctccgggg 

aacgctgccg 

atctacgagg 

gagagtgtct 

gccttccgca 

accaacctgt 

gagtggagtg 

cagaagtacg 

ggcaccgaca 

gccatcatga 

cactgtgggt 

ccagaatctc 

cgtgcccgga 

aataaaccca 

acacgtgtga 

atcttggcca 

accccagctt 

tgcgtgcaga 

gttcgactcc 

aagctgccgg 

gcacccgcgg 



cggacgactc 

ccggcaaggc 

tccgtgccgt 

gctaccaggg 

ccagcatcct 

cgcgggaagg 

gtgtgatcgg 

ggctgctgga 

cctacctcaa 

tgaccatcgg 

ccacggccca 

acctggccct 

caccagagga 

aaaaaaggct 

tcacctctga 

ccatcctcgg 

gccgcatggg 

gcgtcgtgtc 

tgactcagga 

gagggaggag 

atgatcagat 

ctgggatgaa 



ccgggccccc 

catcggcgtg 

ggtgcgcatg 

catggtggac 

gcaagtgggc 

ccgcctgaag 

cggggacggg 

ggagctggcc 

cgtggtgggc 

cacggactcc 

gagccaccag 

ggtgagtgcc 

aggctgggag 

gaatattatt 

gaaaatcaaa 

gcacgtgcag 

agtggaggca 

actgaacggg 

tgtgcagaag 

ctttgcgggc 

cccaaagacc 

cgcggccgta 



cgctcagctg 

tttgacggct 

accggccaag 

gagatcgcca 

gaggcctacc 

gtccccatgg 

atcggggcag 

gccagcggaa 

ctggccaaca 

ttcgacatca 

atccagagag 

atttaccagc 

ggtcacatgc 

tctgccagag 

aaatttacca 

caacctgtgg 

tggtggctca 

gtgtcggact 

tgcatgtgcc 

gcactttatg 

cagctccagt 

gaattaatta 



tgcgcgtggg 

tcgccaaggg 

gaggctccat 

cacagatgcg 

tgggactcct 

tcatggttcc 

acaccgccct 

ccaagcggcg 

tgggggggct 

gggatctgca 

gccttgtgct 

tgtattcaga 

agcagggtgg 

ctatggagtg 

ccgatgattc 

cagagctgaa 

agctacggcc 

caggccagct 

tgcagccacc 

cacgtattat 

gcgtgctgtc 

a 2591 



cattgccgac 

ccagatcaaa 

tcttgggaca 

cacgcacagc 

ggagctgtca 

cgctactgtg 

gaacactatc 

cgtgttcatc 

cgcggccgga 

gtccaacgtg 

cagaaatgag 

agagggcaaa 

ggcaccctct 

gatcactgca 

catttgtgtg 

gaagcaaacg 

cctcatgaaa 

ggaacatgtg 

gtggactgtc 

tgacattaat 

tgtggagtgt 



ggccacagga 

gaaatcggct 

aaacgcgttc 

atcaacgcgc 

gccgcccggg 

tccaacaatg 

accgacacct 

atcgagacca 

gctgatgccg 

gagcacctga 

agctgcagtg 

ggcgtgtttg 

ccatttgata 

aaactcaagg 

ctgggaataa 

gattttgagc 

atcctggcca 

cagccctgga 

tgtttttgta 

acctaatcgg 

gtctcatgct 



tgctcgccat 

ggacagatgt 

tcccggggaa 

tgctgatcat 

agaagcacga 

tgccgggttc 

gcgaccgcat 

tgggcggcta 

catacatttt 

cggagaaaat 

aaaactacac 

actgcaggaa 

gaaactttgg 

aggcccgggg 

gcaaaagaaa 

acaggattcc 

agtacaaggc 

gtgtctgacc 

acacttaagt 

cgagtgccca 

ttcagatgtg 



ctatgatggc 

cgggggctgg 

gtacttggaa 

cggtggattc 

ggagttctgt 

cgatttcagc 

caagcagtcc 

ctgtggctac 

cgaagagccc 

gaagaccacc 

caccgacttc 

gaacgtgctg 

aaccaaaatc 

cagaggaaaa 

cgttattttt 

caaagaacag 

cagctatgac 

cagtcccgcc 

tattttatca 

tctgccccac 

catatgagca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 



<210> 26 
<211> 60 
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<212> DNA 

<213> Homo sapiens 
<300> 

<308> D25328 
<400> 26 

tattttatca gcactttatg cacgtattat tgacattaat acctaatcgg cgagtgccca 

<210> 27 
<211> 2573 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> D50402 



<400> 27 

gaatcggccg atgtgaaccg aatgttgatg taagaggcag ggcactcggc tgcggatggg 6 0 

taacagggcg tgggctggca cacttacttg caccagtgcc cagagagggg gtgcaggctg 120 

aggagctgcc cagagcaccg ctcacactcc cagagtacct gaagtcggca tttcaatgac 180 

aggtgacaag ggtccccaaa ggctaagcgg gtccagctat ggttccatct ccagcccgac 240 

cagcccgacc agcccagggc cacggcaagc acctcccaga gagacctacc tgagtgagaa 3 00 

gatccccatc ccagacacaa aaccgggcac cttcagcctg cggaagctat gggccttcac 3 60 

ggggcctggc ttcctcatga gcattgcttt cctggaccca ggaaacatcg agtcagatct 42 0 

tcaggctggc gccgtggcgg gattcaaact tctctgggtg ctgctctggg ccaccgtgtt 480 

gggcttgctc tgccagcgac tggctgcacg tctgggcgtg gtgacaggca aggacttggg 540 

cgaggtctgc catctctact accctaaggt gccccgcacc gtcctctggc tgaccatcga 600 

gctagccatt gtgggctccg acatgcagga agtcatcggc acggccattg cattcaatct 66 0 

gctctcagct ggacgaatcc cactctgggg tggcgtcctc atcaccatcg tggacacctt 72 0 

cttcttcctc ttcctcgata actacgggct gcggaagctg gaagcttttt ttggactcct 780 

tataaccatt atggccttga cctttggcta tgagtatgtg gtggcgcgtc ctgagcaggg 840 

agcgcttctt cggggcctgt tcctgccctc gtgcccgggc tgcggccacc ccgagctgct 900 

gcaggcggtg ggcattgttg gcgccatcat catgccccac aacatctacc tgcactcggc 960 

cctggtcaag tctcgagaga tagaccgggc ccgccgagcg gacatcagag aagccaacat 102 0 

gtacttcctg attgaggcca ccatcgccct gtccgtctcc tttatcatca acctctttgt 1080 

catggctgtc tttgggcagg ccttctacca gaaaaccaac caggctgcgt tcaacatctg 1140 

tgccaacagc agcctccacg actacgccaa gatcttcccc atgaacaacg ccaccgtggc 12 00 

cgtggacatt taccaggggg gcgtgatcct gggctgcctg ttcggccccg cggccctcta 12 60 

catctgggcc ataggtctcc tggcggctgg gcagagctcc accatgacgg gcacctacgc 13 20 

gggacagttc gtgatggagg gcttcctgag gctgcggtgg tcacgcttcg cccgtgtcct 13 80 

cctcacccgc tcctgcgcca tcctgcccac cgtgctcgtg gctgtcttcc gggacctgag 1440 

ggacttgtcg ggcctcaatg atctgctcaa cgtgctgcag agcctgctgc tcccgttcgc 15 00 

cgtgctgccc atcctcacgt tcaccagcat gcccaccctc atgcaggagt ttgccaatgg 15 60 

cctgctgaac aaggtcgtca cctcttccat catggtgcta gtctgcgcca tcaacctcta 1620 

cttcgtggtc agctatctgc ccagcctgcc ccaccctgcc tacttcggcc ttgcagcctt 1680 

gctggccgca gcctacctgg gcctcagcac ctacctggtc tggacctgtt gccttgccca 17 40 

cggagccacc tttctggccc acagctccca ccaccacttc ctgtatgggc tccttgaaga 1800 

ggaccagaaa ggggagacct ctggctaggc ccacaccagg gcctggctgg gagtggcatg 1860 

tatgacgtga ctggcctgct ggatgtggag ggggcgcgtg caggcagcag gatggagtgg 192 0 

gacagttcct gagaccagcc aacctggggg ctttagggac ctgctgtttc ctagcgcagc 1980 

catgtgatta ccctctgggt ctcagtgtcc tcatctgtaa aatggagacg ccaccaccct 2 040 

tgccatggag gttaagcact ttaacacagt gfcctggcact tgggacaaaa acaaacaaac 210 0 

aaacaaaaaa catttcaaaa ggtatttatt gagcacctgc aggcgtgacc tgacagccca 2160 

agggtgggtg gggtgagggc ttgaggactt gggcgggaca caggctccaa actggagctt 2220 

gaaatagtgt ctgatgaatg ttaaattatc tatctatcta tttatttatt tatttgagac 2280 

agggaaaggg tctccctctg ttgccaaggc tggagtgcag tggcgcaatc ttaactcatt 2340 

gcaacctcca ccttctgggt tcaagcgatt ctctttattc agccccggga gtggcgcgcg 2400 

ccaccacgcc cagctaattt gtgtattttc agcagagacg gggtttgcca tgctggccag 2460 

gctggtctcg aactgctgga ttcaagtgat ccgcccatct ccgtctccca aagtgctggg 2520 
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aattacaggc gtgagccacc aaaacccggc ctgattaaag ttaaataaat acg 2573 

<210> 28 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> D50402 
<400> 28 

tggaggttaa gcactttaac acagtgtctg gcacttggga caaaaacaaa caaacaaaca 60 

<210> 29 

<211> 3672 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> L27560 



<400> 29 
acatgtgcat 
ttggaaagac 
ctcatttaat 
aaaaaagtgg 
ctatgctttt 
caaagtagtt 
catattctcc 
tgagcaataa 
gtgtgttcat 
cgcgcctcag 
ccacaccaca 
ttctttcatc 
agcaggaggg 
gcaggagaac 
tgaagggaga 
aaaagccaat 
aaagagagga 
agccagggaa 
tcagtctaaa 
agcctgcaat 
tccctgcatc 
gattcaggga 
ctcaaccaat 
cttggacact 
tcatggcatg 
agcgaaaaag 
atctcctatg 
gagacaaagg 
catctctctc 
atcccaaacc 
gttatccaac 
agccaggaac 
gctctctctt 
tttgctgcca 
gtttccaaag 
ctcctgggca 
cctcccaaat 
ggaggaagac 
gcgagtatgg 



atttcattcc 
ttgttcttcc 
tcaatttgta 
gctcgtattt 
tctttgtttt 
cctatagggg 
taatacaacc 
actcagcttt 
ctgcacccac 
tccatcccat 
ttgagcccaa 
tgaacagaat 
agactgaagt 
agtaagatgg 
gggggctgac 
agaaatttcc 
aaatggggga 
ggtgagtaac 
ctggccatgc 
gagggactga 
ctccctgggg 
tgctgtcact 
gggtggcctt 
ggccaaggag 
acacagcgga 
caaatctgtt 
cccctggaca 
ggcccgatca 
gacagctgtc 
tcttccacgt 
taggatctgc 
tacaagctcc 
cacggactgt 
gccatcaggt 
ttcgtgcctg 
gctggtcact 
gggagttctg 
ggcacctctt 
caatctaagc 



ccaggcagac 
acggtgacta 
agtagcgcac 
atctacagga 
tttctttctt 
cattgaggag 
cgtaatagca 
tttgtggggc 
cgagccaggc 
tttagtcttt 
caggtccacg 
gtgcattttt 
aagagatggc 
atggttccgg 
tgttccattc 
ttagcttccc 
atgggttgca 
cttaggaggg 
tttggaaggg 
ggcccacgag 
tggaggatgg 
cggcagccag 
gtgactgttc 
accccttcgt 
gaccgcctcc 
tcaactgatg 
ggacagttgg 
aggcagccac 
ccctaaaccc 
cactgttctt 
caagtggata 
ccccacattc 
gtctggactt 
gggggattag 
cgaacccaaa 
ccccagagaa 
gctgagccca 
cccccatggc 
aggggtctgg 



attttttaga 
cagtacatgc 
gagcctctgt 
ctccatatag 
cctttctttc 
cttcctcatt 
tttttgcctg 
acagtactgg 
agaggccagc 
aaaccctcag 
atggatccac 
ggaagcctcc 
agagggagat 
ccagagtcga 
fcagctttggc 
caccatatgt 
aaatagaaat 
tgctagactt 
acaagactat 
tctttccagc 
aaggaaagct 
attccgaaac 
tttaaggctg 
gctgtggaca 
aacaacgaat 
ggaaccctat 
ctggggacag 
aaggccttga 
aactcacgtt 
tccacccatt 
ctggggtgcc 
ctcccagcct 
tgagcaggct 
agcctggtgt 
cctgtgagtc 
gctgggcctt 
gggtggggag 
agggtgtgag 
tctctttgac 



aatcaataca 
tgaagcgtgc 
gggggaggat 
tcatatatag 
aaaggtttgc 
ctgggaaaac 
cctcgaggca 
atttgacagt 
cctccgtggt 
gaagtcacag 
ctagtcccac 
ctcactctcc 
ggtggcaaaa 
tgtggggagg 
acaaagcagc 
attttcatgg 
gagcttaatc 
tagaagccag 
gtgctccgct 
tcttcctcca 
gggacaagca 
tcccattctc 
aagatatcca 
cagctctctt 
ttggggctac 
agctatagaa 
gagaagtgct 
cctgccgagt 
tctgtatgtc 
ctccctttgc 
actcccctga 
ggacctaatt 
tctgcccctt 
aagtgcgcca 
tcttctgcat 
catggacaca 
atcctgggaa 
ggaggcaggt 
tccaggctcg 



tgccccaata 

cgtttcagcc 

aggctgaaaa 

gcatataaat 

attaactttt 

tgagaaaacc 

gagtttcccg 

gattccccac 

gcacacagca 

tctccggaca 

cccagccttt 

atgctggcag 

aggtttagat 

aacagagggc 

agaaaggggg 

atttgagagg 

caggccgcag 

ataggaagaa 

gcccaccttc 

ttctggccag 

gggaacgcat 

caatgacttc 

ggaaaggggg 

cactctttgc 

gaagaggaat 

cttgggggct 

caatcttcat 

cagcatgccc 

ttaggccagt 

atcttgagca 

gaaaagactg 

cttgagaggg 

gcgttggctc 

gactcttccg 

gcaggagttt 

tggaactaag 

gggaggcact 

ttggaatggt 

ctttggccga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 
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ctgtctgctc acccagagac cttggact cc ggactatcca tggctccgaa tctaagtgct 2400 

gcccactccc atgctcacac ccacagaagg tcttcccatc ccctttagat tcgtgcctca 246 0 

ctccaccagt gaggaagatg cctctgtctt tcccacgact gccaggagat agggaagccc 252 0 

agccaggact gaccctcctt cctccagcct gccctgaccc acctggcaaa gcagggcaca 258 0 

tggggaggaa gagactggaa cctttctttg acagccaggc ctagacagac aggcctgggg 2640 

acactggccc atgaggggag gaaggcaggc gcacgaggtc cagggaggcc cttttctgat 2700 

catgcccctt ctctcccacc ccatctcccc accaccacct ctgtggcctc catggtaccc 276 0 

ccacagggct ggcctcccct agagggtggg cctcaaccac ctcgtcccgc cacgcaccgg 2 82 0 

ttagtgagac agggctgcca cgcaaccgcc aagcccccct caaggtggga cagtaccccg 2880 

gacccatcca ctcactcctg agaggctccg gcccagaatg ggaacctcag agaagagctc 2 94 0 

taaggagaag aaaccccata gcgtcagaga ggatatgtct ggcttccaag agaaaggagg 3 000 

ctccgttttg caaagtggag gagggacgag ggacaggggt ttcaccagcc agcaacctgg 3060 

gccttgtact gtctgtgttt ttaaaaccac taaagtgcaa gaattacatt gcactgtttc 312 0 

tccacttttt attttctctt aggcttttgt ttctatttca aacatacttt cttggttttc 3180 

taatggagta tatagtttag tcatttcaca gactctggcc tcctctcctg aaatcctttt 3240 

ggatggggaa agggaaggtg gggagggtcc gaggggaagg ggaccccagc ttccctgtgc 33 00 

ccgctcaccc cactccacca gtccccggtc gccagccgga gtctcctctc taccgccact 33 6 0 

gtcacaccgt agcccacatg gatagcacag ttgtcagaca agattccttc agattccgag 342 0 

ttgctaccgg ttgttttcgt tgttgttgtt gttgtttttc tttttctttt tttttttgaa 3480 

gacagcaata accacagtac atattactgt agttctctat agttttacat acattcatac 3540 

cataactctg ttctctcctc ttttttgttt tcaactttaa aaacaaaaat aaacgatgat 3 6 00 

aatctttact ggtgaaaagg atggaaaaat aaatcaacaa atgcaaccag tttgtgagaa 3 6 60 
aaaaaaaaaa aa 3 672 

<210> 30 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> L27560 
<400> 30 

agcaacctgg gccttgtact gtctgtgttt ttaaaaccac taaagtgcaa gaattacatt 60 

<210> 31 

<211> 1416 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> Modi f i ed_bas e 
<222> 1 . . . 1416 
<223> n = a,c,g, or t 

<3 00> 

<308> M55914 
<400> 31 

aggaattccg gaattccgga attccgatgg atggaacaga aaataaatct aagtttggtg 6 0 

cgaacgccat tctgggggtg tcccttgccg tctgcaaagc tggtgccgtt gagaaggggg 12 0 

tcccctgtac cgccacatcg cgtacttggc tggcaacttc gaagtcatcc tgccagtccc 180 

ggcgttcaag tgtcatcatc aatggcggtt ctcatgctgg caacaagctg gccatgcaga 240 

gtctgtcctc ccagtcggtg cagcaaactc agggaagcca tgccgcattg gagcagaggt 3 00 

ttaccacaac ctgaagaatg tcatcaagga gaaatatggg aaagatgcca ccaatgtggg 3 60 

gatttgcgcg ggtttgctcc caacatcctg gagaataaag aaggcctgga gctgctgaag 420 

actgctattg gaaagcctgg cctacactgt aaaggtggtc atggcatgga cgtagcggcc 480 

tccgagttct tcaggtcagg gaactatgac ctggacttca agtctcccga tgaccccagc 540 
aggtacatct cgcctgacca gctggctgac ctgtacaagt ccttcatcaa ggactaccca 600 
gtggtgtcta tcgaagatcc ctttgaccag gatgactggg gagcttcaga agttcacagc 660 
cagtgcagga atccaggtag tggggggatg actcacagtg accaacccaa agaggatcgc 720 
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caaggcgtga acgagaagtc ctgcaactgc ctcctgctca aagtcaacca gattggctcc 780 

gtgaccgagt ctcttcaggc gtgcaagctg gcccaggcca atggttgggg cgtcatggtg 840 

tctcatcgtt cgggggagac tgaagatacc ttcatcgctg acctggttgt ggggctgtgc 900 

actggggcag atcaagactg gtgccccttg ccgatcacgc gcttggccaa gtacaaccag 9 60 

ctcctcagaa ttgaagagga gctgggcagc aaggctaagt ttgccggcag gaacttcaga 102 0 

aaccccttgg ccaagtaagc tgtgggcagg caagccttcg gtcacctgtt ggctacagac 1080 

ccctcccctg gtgtcagctc aggcagctcg aggcccccga ccaacacttg caggggtccc 1140 

tgctagttag cgcccaccgc cgtggagttc gtaccgcttc cttagaactc tacagaagcc 1200 

aagctccctg gaagccctgt tggcagctct agctttgcag ttgtgtaatt ggcccaagtc 12 60 

attgtttttc tcgccttact ttccaccaag tgtctagagt catgtgagcc tngtgtcatc 13 2 0 

tccggggtgg ccacaggcta gatccccggt ggttttgtgc tcaaaataaa aagcctcagt 13 80 
gacccatgaa aaaaaaaaag gaattccgga attccg 1416 

<210> 32 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> M55914 



<400> 32 

gtaccgcttc cttagaactc tacagaagcc aagctccctg gaagccctgt tggcagctct 60 

<210> 33 

<211> 2517 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> M96577 
<400> 33 

ggaattccgt ggccgggact ttgcaggcag cggcggccgg gggcggagcg ggatcgagcc 60 

ctcgccgagg cctgccgcca tgggcccgcg ccgccgccgc cgcctgtcac ccgggccgcg 12 0 

cgggccgtga gcgtcatggc cttggccggg gcccctgcgg gcggcccatg cgcgccggcg 180 

ctggaggccc tgctcggggc cggcgcgctg cggctgctcg actcctcgca gatcgtcatc 240 

atctccgccg cgcaggacgc cagcgccccg ccggctccca ccggccccgc ggcgcccgcc 300 

gccggcccct gcgaccctga cctgctgctc ttcgccacac cgcaggcgcc ccggcccaca 3 60 

cccagtgcgc cgcggcccgc gctcggccgc ccgccggtga agcggaggct ggacctggaa 42 0 

actgaccatc agtacctggc cgagagcagt gggccagctc ggggcagagg ccgccatcca 480 

ggaaaaggtg tgaaatcccc gggggagaag tcacgctatg agacctcact gaatctgacc 54 0 

accaagcgct tcctggagct gctgagccac tcggctgacg gtgtcgtcga cctgaactgg 600 

gctgccgagg tgctgaaggt gcagaagcgg cgcatctatg acatcaccaa cgtccttgag 660 

ggcatccagc tcattgccaa gaagtccaag aaccacatcc agtggctggg cagccacacc 72 0 

acagtgggcg tcggcggacg gcttgagggg ttgacccagg acctccgaca gctgcaggag 7 80 

agcgagcagc agctggacca cctgatgaat atctgtacta cgcagctgcg cctgctctcc 840 

gaggacactg acagccagcg cctggcctac gtgacgtgtc aggaccttcg tagcattgca 90 0 

gaccctgcag agcagatggt tatggtgatc aaagcccctc ctgagaccca gctccaagcc 9 60 

gtggactctt cggagaactt tcagatctcc cttaagagca aacaaggccc gatcgatgtt 102 0 

ttcctgtgcc ctgaggagac cgtaggtggg atcagccctg ggaagacccc atcccaggag 1080 

gtcacttctg aggaggagaa cagggccact gactctgcca ccatagtgtc accaccacca 1140 

tcatctcccc cctcatccct caccacagat cccagccagt ctctactcag cctggagcaa 12 00 

gaaccgctgt tgtcccggat gggcagcctg cgggctcccg tggacgagga ccgcctgtcc 1260 

ccgctggtgg cggccgactc gctcctggag catgtgcggg aggacttctc cggcctcctc 132 0 

cc tgaggagt tcatcagcct ttccccaccc cacgaggccc tcgactacca cttcggcctc 13 80 

gaggagggcg agggcatcag agacctcttc gactgtgact ttggggacct cacccccctg 144 0 

gatttctgac agggcttgga gggaccaggg tttccagagt agctcacctt gtctctgcag 1500 

ccctggagcc ccctgtccct ggccgtcctc ccagcctgtt tggaaacatt taatttatac 1560 

ccctctcctc tgtctccaga agcttctagc tctggggtct ggctaccgct aggaggctga 162 0 
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gcaagccagg 
gtgtgtaccg 
tgaacataca 
atgagtccat 
ggtcccacaa 
tggggaggct 
ttttctgatt 
ggggaatcag 
gagcaagggc 
ggtgatttat 
tcagatggtg 
tggatatgag 
gaaggcctcc 
cccactgctc 
cctcctgctt 



aagggaagga 
ggggtgaatg 
cctctgtgtg 
ctctgcgcgt 
ggcccagggc 
ttggctggct 
gaagctttaa 
caaaagggga 
aggggtccct 
ttattgggaa 
gggtgggccc 
atgggagagg 
cccagcccag 
tgccccaccc 
tggttttaat 



gtctgtgtgg 
tgtgtgagca 
tgcactgcag 

gggggggctc 

agtgcctgct 
gggcgtgtag 
tggagcgtta 
ggaggggtgt 
gagctgttct 
agtgagggag 
tctccagggg 
tgagtggggg 
accctgtggt 
tccaatctgc 
aaatattttg 



tgtgtatgtg 
tgtgtgtgtg 
acacgcccca 
taactgcact 
cccagaatct 
gacggtgaga 
tttatttatc 
ggggttgata 
tctgccccat 
ggagacagac 
gccagttcag 
accttcactg 
ccctcctgca 
actttgattt 
atgacgttaa 



catgcagcct 
catgtaccgg 
gtgtgtccac 
ttcggccctt 
ggtgctctga 
gcacttctgt 
gaggcctctt 
ccccaactcc 
actgaaggaa 
tgactgacag 
ggcccagctg 
atgtgggcag 
gtgtctgaag 
gcttcctaac 
aaaaaggaat 



acacccacac 
ggaatgaagg 
atgtgtgtgc 
ttgctcgtgg 
ccaggccagg 
cttaaaggtt 
tggtgagcct 
ctctaccctt 
ctgaggcctg 
ccatgggtgg 
ccccccagga 
gaggggtggt 
cgcctgcctc 
agctctgttc 
tcgatat 2 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
517 



<210> 34 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> M96577 



<400> 34 

gtaggacggt gagagcactt ctgtcttaaa ggttttttct gattgaagct ttaatggagc 6 0 

<210> 35 

<211> 4437 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_000057 



<400> 35 

gcgcggcggc cgtggttgcg gcgcgggaag tttggatcct ggttccgtcc gctaggagtc 60 

tgcgtgcgag gattatggct gctgttcctc aaaataatct acaggagcaa ctagaacgtc 12 0 

actcagccag aacacttaat aataaattaa gtctttcaaa accaaaattt tcaggtttca 180 

cttttaaaaa gaaaacatct tcagataaca atgtatctgt aactaatgtg tcagtagcaa 240 

aaacacctgt attaagaaat aaagatgtta atgttaccga agacttttcc ttcagtgaac 3 00 

ctctacccaa caccacaaat cagcaaaggg tcaaggactt ctttaaaaat gctccagcag 3 60 

gacaggaaac acagagaggt ggatcaaaat cattattgcc agatttcttg cagactccga 42 0 

aggaagttgt atgcactacc caaaacacac caactgtaaa gaaatcccgg gatactgctc 480 

tcaagaaatt agaatttagt tcttcaccag attctttaag taccatcaat gattgggatg 540 

atatggatga ctttgatact tctgagactt caaaatcatt tgttacacca ccccaaagtc 600 

actttgtaag agtaagcact gctcagaaat caaaaaaggg taagagaaac ttttttaaag 660 

cacagcttta tacaacaaac acagtaaaga ctgatttgcc tccaccctcc tctgaaagcg 72 0 

agcaaataga tttgactgag gaacagaagg atgactcaga atggttaagc agcgatgtga 7 80 

tttgcatcga tgatggcccc attgctgaag tgcatataaa tgaagatgct caggaaagtg 840 

actctctgaa aactcatttg gaagatgaaa gagataatag cgaaaagaag aagaatttgg 9 00 

aagaagctga attacattca actgagaaag ttccatgtat tgaatttgat gatgatgatt 9 60 

atgatacgga ttttgttcca ccttctccag aagaaattat ttctgcttct tcttcctctt 102 0 

caaaatgcct tagtacgtta aaggaccttg acacatctga cagaaaagag gatgttctta 1080 

gcacatcaaa agatcttttg tcaaaacctg agaaaatgag tatgcaggag ctgaatccag 1140 

aaaccagcac agactgtgac gctagacaga taagtttaca gcagcagctt attcatgtga 1200 

tggagcacat ctgtaaatta attgatacta ttcctgatga taaactgaaa cttttggatt 1260 

gtgggaacga actgcttcag cagcggaaca taagaaggaa acttctaacg gaagtagatt 132 0 

ttaataaaag tgatgccagt cttcttggct cattgtggag atacaggcct gattcacttg 13 80 

atggccctat ggagggtgat tcctgcccta cagggaattc tatgaaggag ttaaattttt 1440 

cacaccttcc ctcaaattct gtttctcctg gggactgttt actgactacc accctaggaa 1500 
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agacaggatt 
tacagaagtc 
aaagctctta 
atactgcttc 
ttgacataga 
ccagcaaatc 
tatcagaaag 
atataaactt 
ccagaaatct 
tgaagatttt 
tcaatgctgc 
gtttgtgtta 
tgagatcact 
atctgacagg 
aagacccaat 
tcatttctac 
aagcacattg 
tgcttcgcca 
gggtacagaa 
gctttaacag 
ttgattgcct 
tctccaggcg 
ttgcttacca 
atcaggatgg 
cggacgtgcg 
aatctggcag 
atgatgtgac 
gagaaactca 
gcaggagaat 
agaaacaccc 
atgtgactga 
gaatgagaaa 
acattttctt 
cttattcacg 
atgaagactt 
aagcccaaac 
gcagtgtgaa 
aaaaatgtct 
attacttcaa 
atcctgaggt 
cggaagtgat 
ccccagggat 
acgaggaaat 
ggaaaaagat 
aggcaaaggg 
tcattggatc 
tggggattat 
tctcataaca 
catctgtgac 



ctctgccacc 
ctttgtaagt 
tttcccagga 
aataaatgac 
tgactttgat 
ttccacagct 
actttcctca 
ctcagagtca 
gaaacatgag 
tcataaaaaa 
actgcttggt 
ccagctccct 
tatcgtagat 
tgataagact 
cataaaactt 
tctggagaat 
tgtcagtcag 
gaagtttcct 
ggacatcctg 
acataatctg 
agaatggatc 
agaatgtgac 
tgctggcctc 
ctgtcaggtt 
atttgtgatt 
agctggaaga 
cagactgaaa 
cttcaataat 
acagcttttg 
agatgtttct 
cgatgtgaaa 
tataaaacat 
ggggagtaag 
acacaatgcc 
atatatcaat 
tgtactaaat 
aaaacaaaaa 
tggagaactt 
tatttttaat 
tttgcttcaa 
ttcagtatta 
aagcctgtcc 
acccgtatct 
gccagcctcc 
ggggtctgcc 
cagttcagcc 
ggctccaccg 
accgaatctc 
tataaagctg 



aggaagaatc 
agcaactggg 
aatgttctca 
ttagaaagag 
gatgatgatg 
gcctatcaac 
gccaagacag 
attcagaatt 
cgtttccaaa 
tttggcctgc 
gaagactgtt 
gcctgtgttt 
caagtccaaa 
gactcagaag 
ctatatgtca 
ctctatgaga 
tggggacatg 
tctgttccgg 
actcagctga 
aaatactatg 
agaaagcacc 
accatggctg 
agtgattctg 
atctgtgcta 
catgcatctc 
gatggggaaa 
agacttataa 
ttgtatagca 
gcctactttg 
tgtgataatt 
agtattgtaa 
gtaggtcctt 
agtgcaaaaa 
gaaagacttt 
gccaatgacc 
ggcaatttaa 
gcgttagtag 
acagaagtct 
accgtcactc 
attgatggtg 
cagaaatact 
agcagcagag 
tcccactact 
caaaggtcta 
acatgtagaa 
tcacatactt 
aagcctataa 
aatgtacata 
ttattcttgt 



tttttgaaag 
ctgaaacacc 
caagcactgc 
aaacccaacc 
actgggaaga 
ccatcaagga 
actgtcttcc 
atactgacaa 
gtcttagttt 
ataattttag 
ttatcctgat 
ctcctggggt 
agctgacttc 
ctacaaatat 
ctccagaaaa 
ggaagctctt 
attttcgtca 
tgatggctct 
agattctcag 
tattaccgaa 
acccatatga 
acacgttaca 
ccagagatga 
caattgcatt 
tccctaaatc 
tatctcactg 
tgatggaaaa 
tggtacatta 
gtgaaaatgg 
gctgtaaaac 
gatttgttca 
ctggaagatt 
tccagtcagg 
ttaaaaagct 
aggcgatcgc 
aggtagactt 
caaaagtgtc 
gcaaatctct 
tcaagaagct 
ttactgaaga 
ctgaatggac 
gccccggaag 
ttgcaagtaa 
agaggagaaa 
agatatcttc 
ctcaagcgac 
atagaccgtt 
gaccctcttt 
tataccaaaa 



gcctttattc 
aagactagga 
tgtgaaagat 
ttcctatgat 
cataatgcat 
aggtcggcca 
agtgtcatct 
gtcagcacaa 
tcctcataca 
aactaatcag 
gccgactgga 
cactgttgtc 
cttggatatt 
ttacctccag 
gatctgtgca 
ggcacgtttt 
agattacaaa 
tacggccaca 
acctcaggtg 
aaagcctaaa 
ttcagggata 
gagagatggg 
agtgcagcag 
tggaatgggg 
tgtggagggt 
cctgcttttc 
agatggaaac 
ctgtgaaaat 
atttaatcct 
aaaggattat 
agaacatagt 
tactatgaat 
tatatttgga 
gatacttgac 
ttatgtgatg 
tatggaaaca 
tcagagggaa 
ggggaaagtt 
tgcagaatct 
caaactggaa 
atcgccagct 
aagtgccgct 
aaccagaaat 
aactgcttcc 
caaaacgaaa 
atcaggagcc 
tcttaagcct 
cttgtttgtc 
aaaaaaaaaa 



aatacccatt 1560 

aaaaaaaatg 1620 

cagaat aaac 1680 

attgataatt 1740 

aatttagcag 1800 

attaaatcag 1860 

actgctcaaa 1920 

aatttagcat 1980 

aaggaaatga 2 040 

ctagaggcga 2100 

ggtggtaaga 216 0 

atttctccct 2220 

ccagctacat 2280 

ttatcaaaaa 23 40 

agtaacagac 24 00 

gttattgatg 24 6 0 

agaatgaata 252 0 

gctaatccca 2580 

tttagcatga 2 640 

aaggtggcat 27 0 0 

atttactgcc 2760 

ctcgctgctc 2 82 0 

aagtggatta 2880 

attgacaaac 2 940 

tactaccaag 3 000 

tatacctatc 3 060 

catcatacaa 3120 

ataacggaat 3180 

gatttttgta 3240 

aaaacaagag 33 00 

tcatcacaag 3 3 60 

atgctggtcg 3 42 0 

aaaggatctg 3480 

aagattttgg 3 540 

ctcggaaata 3600 

gaaaattcca 366 0 

gagatggtta 372 0 

tttggtgtcc 3780 
ttatcttctg 3840 
aaatatggtg 3900 
gaagacagtt 3960 
gaggagcttg 402 0 
gaaaggaaga 4080 
agtggttcca 4140 
tcctccagca 4200 
aatagcaaat 42 60 
tcatatgcat 43 2 0 
agcatctgac 43 80 
aaaaaaa 4437 



<210> 36 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM__000057 



taagccttca tatgcattct cataacaacc gaatctcaat gtacatagac octet ttctt 60 
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240 
300 



<210> 37 
<211> 2016 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_000060 
<400> 37 

gccagctgga gcgttttcgg ggctgtaaag ggagaatggc gcatgcgcat attcagggcg 60 

gaaggcgcgc taagagcaga tttgtggtct gcattatgtc tggagccaga agtaagcttg 12 0 

ctcttttcct ctgcggctgt tacgtggttg ccctgggagc ccacaccggg gaggagagcg 180 
tggctgacca tcacgaggct gaatattatg tggctgccgt gtatgagcat ccatccatcc 
tgagtctgaa ccctctggct ctcatcagcc gccaagaggc cttggagctc atgaacpaga 

accttgacat ctatgaacag caagtgatga ctgcagccca aaaggatgta cagattatag 3 60 

tgtttccaga agatggcatt catggattca actttacaag aacatccatt tatccatttt 42 0 

tggacttcat gccgtctccc caggtggtca ggtggaaccc atgcctggag cctcaccgct 480 

tcaatgacac agaggtgctc cagcgcctga gttgtatggc catcagggga gatatgttct 540 

tggtggccaa tcttgggaca aaggagcctt gtcatagcag tgacccaagg tgcccaaaag 6 00 

atgggagata ccagttcaac acaaatgtcg tgttcagcaa taatggaacc cttgttgacc 66 0 

gctaccgtaa acacaacctc tactttgagg cagcattcga tgttcctctt aaagtggatc 72 0 

tcatcacctt tgataccccc tttgctggca ggtttggcat cttcacatgc tttgatatat 7 80 

tgttctttga ccctgccatc agagtcctca gagactacaa ggtgaagcat gttgtgtacc 840 

caactgcctg gatgaaccag ctcccactct tggcagcaat tgagattcag aaagcttttg 900 

ctgttgcctt tggcatcaac gttctggcag ctaatgtcca ccacccagtt ctggggatga 9 60 

caggaagtgg catacacacc cctctggagt ccttttggta ccatgacatg gaaaatccca 1020 

aaagtcacct tataattgcc caggtggcca aaaatccagt gggtctcatt ggtgcagaga 1080 

atgcaacagg tgaaacggac ccatcccata gtaagttttt aaaaattttg tcaggcgatc 1140 

cgtactgtga gaaggatgct caggaagt cc actgtgatga ggccaccaag tggaacgtga 12 00 

atgctcctcc cacatttcac tctgagatga tgtatgacaa tttcaccctg gtccctgtct 12 6 0 

ggggaaagga aggctatctc cacgtctgtt ccaatggcct ctgctgttat ttactttacg 132 0 

agaggcccac cttatccaaa gagctgtatg ccctgggggt ctttgatggg cttcacacag 13 80 

tacatggcac ttactacatc caagtgtgtg ccctggtcag gtgtgggggt cttggcttcg 1440 

acacctgcgg acaggaaatc acagaggcca cggggatatt tgagtttcac ctgtggggca 1500 

acttcagtac ttcctatatc tttcctttgt ttctgacctc agggatgacc ctagaagtcc 1560 

ctgaccagct tggctgggag aatgaccact atttcctgag gaaaagtagg ctgtcctctg 162 0 

ggctggtgac ggcggctctc tatgggcgct tgtatgagag ggactaggaa aagtgtgtgg 1680 

tctgtggggc ggactctggc catcatgttg acagccttgc acttccacag gctacaagcc 174 0 

ctgggaccat ctttctgcct taagggcagg agcccacttc tgtggcacca gattccaccc 1800 

tgggaactgt ggaaaaagta ggagaggcag attccctcag tgtcttcctc ttaaacctca 186 0 

atcatcgaga cattaggggg tattttctgt tcacatttat ctttttcaag ccacatcttc 192 0 

ctctaacaaa tctctcagta tgcgattggt ctcaagctaa aacaaaaata aatgtcagtt 1980 
tatattttac acatccaaaa aaaaaaaaaa aaaaaa 2 016 

<210> 38 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000060 



<400> 38 

tcctctaaca aatctctcag tatgcgattg gtctcaagct aaaacaaaaa taaatgtcag 

<210> 39 

<211> 811 

<212> DNA 

<213> Homo sapiens 



60 
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<300> 

<308> NHL.000269 
<400> 39 

gcagaagcgt tccgtgcgtg caagtgctgc gaaccacgtg ggtcccgggc gcgtttcggg 60 

tgctggcggc tgcagccgga gttcaaacct aagcagctgg aaggaaccat ggccaactgt 12 0 

gagcgtacct tcattgcgat caaaccagat ggggtccagc ggggtcttgt gggagagatt 180 

atcaagcgtt ttgagcagaa aggattccgc cttgttggtc tgaaattcat gcaagcttcc 240 

gaagatcttc tcaaggaaca ctacgttgac ctgaaggacc gtccattctt tgccggcctg 3 00 

gtgaaataca tgcactcagg gccggtagtt gccatggtct gggaggggct gaatgtggtg 3 60 

aagacgggcc gagtcatgct cggggagacc aaccctgcag actccaagcc tgggaccatc 42 0 

cgtggagact tctgcataca agttggcagg aacattatac atggcagtga ttctgtggag 480 

agtgcagaga aggagatcgg cttgtggttt caccctgagg aactggtaga ttacacgagc 540 

tgtgctcaga actggatcta tgaatgacag gagggcagac cacattgctt ttcacatcca 600 

tttcccctcc ttcccatggg cagaggacca ggctgtagga aatctagtta tttacaggaa 6 60 

cttcatcata atttggaggg aagctcttgg agctgtgagt tctccctgta cagtgttacc 72 0 

atccccgacc atctgattaa aatgcttcct cccagcatag gattcattga gttggttact 7 80 
tcatattgtt gcattgcttt tttttccttc t 811 

<210> 40 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000269 
<400> 40 

gtctgaaatt catgcaagct tccgaagatc ttctcaagga acactacgtt gacctgaagg 60 

<210> 41 

<211> 2338 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000291 
<400> 41 

agcgcacgtc ggcagtcggc tccctcgttg accgaatcac cgacctctct ccccagctgt 6 0 

atttccaaaa tgtcgctttc taacaagctg acgctggaca agctggacgt taaagggaag 12 0 

cgggtcgtta tgagagtcga cttcaatgtt cctatgaaga acaaccagat aacaaacaac 180 

cagaggatta aggctgctgt cccaagcatc aaattctgct tggacaatgg agccaagtcg 240 

gtagtcctta tgagccacct aggccggcct gatggtgtgc ccatgcctga caagtactcc 3 00 

ttagagccag ttgctgtaga actcaaatct ctgctgggca aggatgttct gttcttgaag 3 60 

gactgtgtag gcccagaagt ggagaaagcc tgtgccaacc cagctgctgg gtctgtcatc 42 0 

ctgctggaga acctccgctt tcatgtggag gaagaaggga agggaaaaga tgcttctggg 480 

aacaaggtta aagccgagcc agccaaaata gaagctttcc gagcttcact ttccaagcta 54 0 

ggggatgtct atgtcaatga tgcttttggc actgctcaca gagcccacag ctccatggta 600 

ggagtcaatc tgccacagaa ggctggtggg tttttgatga agaaggagct gaactacttt 660 

gcaaaggcct tggagagccc agagcgaccc ttcctggcca tcctgggcgg agctaaagtt 72 0 

gcagacaaga tccagctcat caataatatg ctggacaaag tcaatgagat gattattggt 7 80 

ggtggaatgg cttttacctt ccttaaggtg ctcaacaaca tggagattgg cacttctctg 840 

tttgatgaag agggagccaa gattgtcaaa gacctaatgt ccaaagctga gaagaatggt 900 

gtgaagatta ccttgcctgt tgactttgtc actgctgaca agtttgatga gaatgccaag 9 60 

actggccaag ccactgtggc ttctggcata cctgctggct ggatgggctt ggactgtggt 102 0 

cctgaaagca gcaagaagta tgctgaggct gtcactcggg ctaagcagat tgtgtggaat 1080 

ggtcctgtgg gggtatttga atgggaagct tttgcccggg gaaccaaagc tctcatggat 1140 

gaggtggtga aagccacttc taggggctgc atcaccatca taggtggtgg agacactgcc 12 0 0 

acttgctgtg ccaaatggaa cacggaggat aaagtcagcc atgtgagcac tgggggtggt 1260 

gccagtttgg agctcctgga aggtaaagtc cttcctgggg tggatgctct cagcaatatt 132 0 



21 



WO 2005/086891 



PCT/US2005/007894 



tagtactttc ctgcctttta gttcctgtgc acagccccta agtcaactta gcattttctg 13 80 

catctccact tggcattagc taaaaccttc catgtcaaga ttcagctagt ggccaagaga 1440 

tgcagtgcca ggaaccctta aacagttgca cagcatctca gctcatcttc actgcaccct 15 00 

ggatttgcat acattcttca agatcccatt tgaatttttt agtgactaaa ccattgtgca 1560 

ttctagagtg catatattta tattttgcct gttaaaaaga aagtgagcag tgttagctta 1620 

gttctctttt gatgtaggtt attatgatta gctttgtcac tgtttcacta ctcagcatgg 1680 

aaacaagatg aaattccatt tgtaggtagt gagacaaaat tgatgatcca ttaagtaaac 1740 

aataaaagtg tccattgaaa ccgtgatttt tttttttttc ctgtcatact ttgttaggaa 1800 

gggtgagaat agaatcttga ggaacggatc agatgtctat attgctgaat gcaagaagtg 1860 

gggcagcagc agtggagaga tgggacaatt agataaatgt ccattcttta tcaagggcct 1920 



1980 



actttatggc agacattgtg ctagtgcttt tattctaact tttattttta tcagttacac 

atgatcataa tttaaaaagt caaggcttat aacaaaaaag ccccagccca ttcctcccat 2 040 

tcaagattcc cactccccag aggtgaccac tttcaactct tgagtttttc aggtatatac 2100 

ctccatgttt ctaagtaata tgcttatatt gttcacttcc ttttttttta ttttttaaag 2160 

aaatctattt cataccatgg aggaaggctc tgttccacat atatttccac ttcttcattc 2220 

tctcggtata gttttgtcac aattatagat tagatcaaaa gtctacataa ctaatacagc^ .2 80 
tgagctatgt agtatgctat gattaaattt acttatgtaa aaaaaaaaaa aaaaaaaa 



2338 



<210> 42 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_000291 



acttagcatt ttctgcatct ccacttggca ttagctaaaa ccttccatgt caagattcag 6 0 

<210> 43 
<211> 787 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_000363 

<400> 43 cn 
ctgaaggtca cccgggcggc cccctcactg accctccaaa cgcccctgtc ctcgccctgc 60 
ctcctgccat tcccggcctg agtctcagca tggcggatgg gagcagcgat gcggctaggg 120 
aacctcgccc tgcaccagcc ccaatcagac gccgctcctc caactaccgc gcttatgcca 180 
cggagccgca cgccaagaaa aaatctaaga tctccgcctc gagaaaattg cagctgaaga 
ctctgctgct gcagattgca aagcaagagc tggagcgaga ggcggaggag cggcgcggag 
agaaggggcg cgctctgagc acccgctgcc agccgctgga gttgaccggg ctgggcttcg 3 60 
cggagctgca ggacttgtgc cgacagctcc acgcccgtgt ggacaaggtg gatgaagaga 420 
gatacgacat agaggcaaaa gtcaccaaga acatcacgga gattgcagat ctgactcaga 480 
agatctttga ccttcgaggc aagtttaagc ggcccaccct gcggagagtg aggatctctg 540 
cagatgccat gatgcaggcg ctgctggggg cccgggctaa ggagtccctg gacctgcggg 600 
cccacctcaa gcaggtgaag aaggaggaca ccgagaagga aaaccgggag gtgggagact 660 
ggcggaagaa catcgatgca ctgagtggaa tggagggccg caagaaaaag tttgagagct ...n 
gagccttcct gcctactgcc cctgccctga ggagggccac tgaggaataa agcttctctc 
tgagctg 787 



<210> 44 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_000363 



240 
300 



720 
780 
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<400> 44 

tgtggacaag gtggatgaag agagatacga catagaggca aaagt caeca agaacatcac 60 

<210> 45 

<211> 1263 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000365 
<400> 45 

ggcacgagac cttcagcgcc tcggctccag cgccatggcg ccctccagga agttcttcgt 60 

tgggggaaac tggaagatga aegggeggaa gcagagtctg ggggagctca tcggcactct 120 

gaacgcggcc aaggtgccgg ccgacaccga ggtggtttgt gctcccccta ctgectatat 180 

cgacttcgcc eggcagaage tagatcccaa gattgctgtg getgegcaga actgetacaa 240 

agtgactaat ggggctttta ctggggagat cagccctggc atgatcaaag actgeggage 300 

cacgtgggtg gtcctggggc actcagagag aaggcatgtc tttggggagt cagatgagct 3 60 

gattgggcag aaagtggccc atgctctggc agagggactc ggagtaatcg ectgeattgg 42 0 

ggagaagcta gatgaaaggg aagctggcat cactgagaag gttgttttcg agcagacaaa 480 

ggtcatcgea gataacgtga aggactggag caaggtegtc ctggcctatg agcctgtgtg 540 

ggccattggt actggcaaga ctgcaacacc ccaacaggcc caggaagtac acgagaagct 600 

ccgaggatgg ctgaagtcca aegtctctga tgcggtggct cagagcaccc gtatcattta 660 

tggaggctct gtgactgggg caacctgcaa ggagctggcc agccagcctg atgtggatgg 72 0 

cttccttgtg ggtggtgctt ccctcaagcc egaattegtg gacatcatca atgecaaaca 7 80 

atgagcccca tccatcttcc ctacccttcc tgccaagcca gggactaagc ageccagaag 840 

cccagtaact gccctttccc tgeatatget tctgatggtg tcatctgctc cttcctgtgg 900 

cctcatccaa actgtatctt cctttactgt ttatatcttc accctgtaat ggttgggacc 960 

aggccaatcc cttctccact tactataatg gttggaacta aacgt caeca aggtggcttc 1020 

tccttggctg agagatggaa ggcgtggtgg gatttgetec tgggttccct aggecctagt 1080 

gagggcagaa gagaaaccat cctctccctt cttacaccgt gaggecaaga tcccctcaga 1140 

aggcaggagt gctgccctct cccatggtgc ccgtgcctct gtgctgtgta tgtgaaccac 1200 

ccatgtgagg gaataaacct ggcactagga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 12 60 
aaa 12 63 

<210> 46 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000365 
<400> 46 

tatcttcacc ctgtaatggt tgggaccagg ccaatccctt ctccacttac tataatggtt 60 

<210> 47 

<211> 1616 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000582 
<400> 47 

ctccctgtgt tggtggagga tgtctgeage agcatttaaa ttctgggagg gcttggttgt 60 
cagcagcagc aggaggaggc agagcacagc ategteggga ccagactcgt ctcaggccag 120 
ttgeagcett ctcagccaaa cgccgaccaa ggaaaactca ctaccatgag aattgcagtg 180 
atttgetttt gcctcctagg catcacctgt gccataccag ttaaacaggc tgattctgga 240 
agttctgagg aaaagcagct ttacaacaaa tacccagatg ctgtggccac atggctaaac 300 
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cctgacccat ctcagaagca gaatctccta gccccacaga cccttccaag taagtccaac 3 60 

gaaagccatg accacatgga tgatatggat gatgaagatg atgatgacca tgtggacagc 420 

caggactcca ttgactcgaa cgactctgat gatgtagatg acactgatga ttctcaccag 480 

tctgatgagt ctcaccattc tgatgaatct gatgaactgg tcactgattt tcccacggac 540 

ctgccagcaa ccgaagtttt cactccagtt gtccccacag tagacacata tgatggccga 600 

ggtgatagtg tggtttatgg actgaggtca aaatctaaga agtttcgcag acctgacatc 660 

cagtaccctg atgctacaga cgaggacatc acctcacaca tggaaagcga ggagttgaat 72 0 

ggtgcataca aggccatccc cgttgcccag gacctgaacg cgccttctga ttgggacagc 780 

cgtgggaagg acagttatga aacgagtcag ctggatgacc agagtgctga aacccacagc 840 

cacaagcagt ccagattata taagcggaaa gccaatgatg agagcaatga gcattccgat 900 

gtgattgata gtcaggaact ttccaaagtc agccgtgaat tccacagcca tgaatttcac 9 60 

agccatgaag atatgctggt tgtagacccc aaaagtaagg aagaagataa acacctgaaa 102 0 

tttcgtattt ctcatgaatt agatagtgca tcttctgagg tcaattaaaa ggagaaaaaa 1080 

tacaatttct cactttgcat ttagtcaaaa gaaaaaatgc tttatagcaa aatgaaagag 1140 

aacatgaaat gcttctttct cagtttattg gttgaatgtg tatctatttg agtctggaaa 12 00 

taactaatgt gtttgataat tagtttagtt tgtggcttca tggaaactcc ctgtaaacta 12 60 

aaagcttcag ggttatgtct atgttcattc tatagaagaa atgcaaacta tcactgtatt 132 0 

ttaatatttg ttattctctc atgaatagaa atttatgtag aagcaaacaa aatactttta 13 80 

cccacttaaa aagagaatat aacattttat gtcactataa tcttttgttt tttaagttag 1440 

tgtatatttt gttgtgatta tctttttgtg gtgtgaataa atcttttatc ttgaatgtaa 1500 

taagaatttg gtggtgtcaa ttgcttattt gttttcccac ggttgtccag caattaataa 15 60 
aacataacct tttttactgc ctaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 1616 

<210> 48 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000582 
<400> 48 

ggtggtgtca attgcttatt tgttttccca cggttgtcca gcaattaata aaacataacc 60 

<210> 49 

<211> 1666 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000584 
<400> 49 

ctccataagg cacaaacttt cagagacagc agagcacaca agcttctagg acaagagcca 60 

ggaagaaacc accggaagga accatctcac tgtgtgtaaa catgacttcc aagctggccg 120 

tggctctctt ggcagccttc ctgatttctg cagctctgtg tgaaggtgca gttttgccaa 180 

ggagtgctaa agaacttaga tgtcagtgca taaagacata ctccaaacct ttccacccca 2 40 

aatttatcaa agaactgaga gtgattgaga gtggaccaca ctgcgccaac acagaaatta 3 00 

ttgtaaagct ttctgatgga agagagctct gtctggaccc caaggaaaac tgggtgcaga 3 60 

gggttgtgga gaagtttttg aagagggctg agaattcata aaaaaattca ttctctgtgg 420 

tatccaagaa tcagtgaaga tgccagtgaa acttcaagca aatctacttc aacacttcat 480 

gtattgtgtg ggtctgttgt agggttgcca gatgcaatac aagattcctg gttaaatttg 540 

aatttcagta aacaatgaat agtttttcat tgtaccatga aatatccaga acatacttat 600 

atgtaaagta ttatttattt gaatctacaa aaaacaacaa ataattttta aatataagga 660 

ttttcctaga tattgcacgg gagaatatac aaatagcaaa attgaggcca agggccaaga 72 0 

gaatatccga actttaattt caggaattga atgggtttgc tagaatgtga tatttgaagc 780 

atcacataaa aatgatggga caataaattt tgccataaag tcaaatttag ctggaaatcc 840 

tggatttttt tctgttaaat ctggcaaccc tagtctgcta gccaggatcc acaagtcctt 900 

gttccactgt gccttggttt ctcctttatt tctaagtgga aaaagtatta gccaccatct 960 

tacctcacag tgatgttgtg aggacatgtg gaagcacttt aagttttttc atcataacat 102 0 

aaattatttt caagtgtaac ttattaacct atttattatt tatgtattta tttaagcatc 1080 
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aaatatttgt gcaagaattt ggaaaaatag aagatgaatc attgattgaa tagttataaa 1140 

gatgttatag taaatttatt ttattttaga tattaaatga tgttttatta gataaatttc 12 00 

aatcagggtt tttagattaa acaaacaaac aattgggtac ccagttaaat tttcatttca 12 60 

gataaacaac aaataatttt ttagtataag tacattattg tttatctgaa attttaattg 132 0 

aactaacaat cctagtttga tactcccagt cttgtcattg ccagctgtgt tggtagtgct 13 80 

gtgttgaatt acggaataat gagttagaac tattaaaaca gccaaaactc cacagtcaat 1440 

attagtaatt tcttgctggt tgaaacttgt ttattatgta caaatagatt cttataatat 1500 

tatttaaatg actgcatttt taaatacaag gctttatatt tttaacttta agatgttttt 1560 

atgtgctctc caaatttttt ttactgtttc tgattgtatg gaaatataaa agtaaatatg 162 0 
aaacatttaa aatataattt gttgtcaaag taaaaaaaaa aaaaaa 16 6 6 

<210> 50 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> N]yt_000584 
<400> 50 

tggtagtgct gtgttgaatt acggaataat gagttagaac tattaaaaca gccaaaactc 60 

<210> 51 

<211> 1722 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_000599 
<400> 51 

ggggaaaaga gctaggaaag agctgcaaag 
ttttcattac ccctcctccg ttttcaccct 
tcgaagagga ggtggcaaag tgggagaaaa 
ttgtttttgt tttttaattt cttgatttca 
aacgcctctt acctgttctg cggcgccgcg 
ggtgtatttt agattttaag caaaaatttt 
caacgccatc tccactgcat ccgatctcat 
tttgtggctt tttttcccct ataattctga 
ccgctcactg cgtgcacctg gcgctgccct 
tcttgcaact gggacctgct cgcaggcacc 
aagtgtctgg gggagggcac ctgctctacc 
aaaaaaatct ccgggggccc tcttggcccc 
accccgaggt aaagggggcg actaagagaa 
gctggccgcc tatgcggggc cggcccagag 
cgacgagaaa gccctctcca tgtgcccccc 
gccgggctgc ggctgctgca tgacctgcgc 
caccgagcgc tgcgcccagg ggctgcgctg 
gcacgccctg ctgcacggcc gcggggtttg 
caagatcgag agagactccc gtgagcacga 
gacctactcc cccaagatct tccggcccaa 
agcagtgaag aaggaccgca gaaagaagct 
gaacactgcc cacccccgga tcatctctgc 
cccctgccgc agacacatgg aggcttccct 
gccccgtgct gtgtacctgc ccaattgtga 
caaaccttcc cgtggccgca agcgtggcat 
gctgccaggc atggagtacg ttgacgggga 
cgttgagtga tgcgtccccc cccaaccttt 
tccagccagc gcctccctcc accccaggac 
atatatatct atctatttga ggaaaaaaaa 



cagtgtgggc tttttccctt tttttgctcc 60 

tctccggact tcgcgtagaa cctgcgaatt 12 0 

gaggtgttag ggtttggggt ttttttgttt 180 

acattttctc ccaccctctc ggctgcagcc 240 

caccgctggc agctgagggt tagaaagcgg 300 

aaagataaat ccatttttct ctcccacccc 3 60 

tatttcggtg gttgcttggg ggtgaacaat 42 0 

cccgctcagg cttgagggtt tctccggcct 480 

gcttccccca acctgttgca aggctttaat 540 

ccagccctcc acctctctct acatttttgc 600 

tgccagaaat tttaaaacaa aaacaaaaac 660 

tttatccctg cactctcgct ctcctgcccc 720 

gatggtgttg ctcaccgcgg tcctcctgct 780 

cctgggctcc ttcgtgcact gcgagccctg 840 

cagccccctg ggctgcgagc tggtcaagga 9 00 

cctggccgag gggcagtcgt gcggcgtcta 9 60 

cctcccccgg caggacgagg agaagccgct 1020 

cctcaacgaa aagagctacc gcgagcaagt 1080 

ggagcccacc acctctgaga tggccgagga 1140 

acacacccgc atctccgagc tgaaggctga 12 00 

gacccagtcc aagtttgtcg ggggagccga 12 60 

acctgagatg agacaggagt ctgagcaggg 132 0 

gcaggagctc aaagccagcc cacgcatggt 1380 

ccgcaaagga ttctacaaga gaaagcagtg 1440 

ctgctggtgc gtggacaagt acgggatgaa 1500 

ctttcagtgc cacaccttcg acagcagcaa 1560 

ccctcacccc ctcccacccc cagccccgac 1620 

gccactcatt tcatctcatt taagggaaaa 1680 
aaaaaaaaaa aa 1722 
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<210> 52 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000599 
<400> 52 

ccaggacgcc actcatttca tctcatttaa gggaaaaata tatatctatc tatttgagga 60 

<210> 53 

<211> 704 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_000735 



<400> 53 

gcagttactg agaactcata agacgaagct aaaatccctc ttcggatcca cagtcaaccg 60 

ccctgaacac atcctgcaaa aagcccagag aaaggagcgc catggattac bacagaaaat 12 0 

atgcagctat ctttctggtc acattgtcgg tgtttctgca tgttctccat tccgctcctg 180 

atgtgcagga ttgcccagaa tgcacgctac aggaaaaccc attcttctcc cagccgggtg 240 

ccccaatact tcagtgcatg ggctgctgct tctctagagc atatcccact ccactaaggt 3 00 

ccaagaagac gatgttggtc caaaagaacg tcacctcaga gtccacttgc tgtgtagcta 3 60 

aatcatataa cagggtcaca gtaatggggg gtttcaaagt ggagaaccac acggcgtgcc 42 0 

actgcagtac ttgttattat cacaaatctt aaatgtttta ccaagtgctg tcttgatgac 480 

tgctgatttt ctggaatgga aaattaagtt gtttagtgtt tatggctttg tgagataaaa 540 

ctctcctttt ccttaccata ccactttgac acgcttcaag gatatactgc agctttactg 600 

ccttcctcct tatcctacag tacaatcagc agtctagttc ttttcatttg gaatgaatac 660 
agcattaagc ttgttccact gcaaataaag ccttttaaat catc 7 04 



<210> 54 
<211> 60 
<212> DNA 
<213> Homo 



sapiens 



<300> 

<308> NM_000735 



<400> 54 

tgagataaaa ctctcctttt ccttaccata ccactttgac acgcttcaag gatatactgc 60 

<210> 55 

<211> 1342 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_000799 



<400> 55 

cccggagccg gaccggggcc accgcgcccg 
ccgccctctc ctccaggccc gtggggctgg 
gcccccggtg tggtcacccg gcgcgcccca 
gatgggggtg cacgaatgtc ctgcctggct 
tctgggcctc ccagtcctgg gcgccccacc 
gaggtacctc ttggaggcca aggaggccga 
cagcttgaat gagaatatca ctgtcccaga 



ctctgctccg acaccgcgcc ccctggacag 60 

ccctgcaccg ccgagcttcc cgggatgagg 12 0 

ggtcgctgag ggaccccggc caggcgcgga 180 

gtggcttctc ctgtccctgc tgtcgctccc 240 

acgcctcatc tgtgacagcc gagtcctgga 300 

gaatatcacg acgggctgtg ctgaacactg 360 

caccaaagtt aatttctatg cctggaagag 42 0 
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gatggaggtc gggcagcagg ccgtagaagt ctggcagggc ctggccctgc tgtcggaagc 480 

tgtcctgcgg ggccaggccc tgttggtcaa ctcttcccag ccgtgggagc ccctgcagct 540 

gcatgtggat aaagccgtca gtggccttcg cagcctcacc actctgcttc gggctctgcg 600 

agcccagaag gaagccatct cccctccaga tgcggcctca gctgctccac tccgaacaat 660 

cactgctgac actttccgca aactcttccg agtctactcc aatttcctcc ggggaaagct 720 

gaagctgtac acaggggagg cctgcaggac aggggacaga tgaccaggtg tgtccacctg 7 80 

ggcatatcca ccacctccct caccaacatt gcttgtgcca caccctcccc cgccactcct 840 

gaaccccgtc gaggggctct cagctcagcg ccagcctgtc ccatggacac tccagtgcca 900 

gcaatgacat ctcaggggcc agaggaactg tccagagagc aactctgaga tctaaggatg 960 

tcacagggcc aacttgaggg cccagagcag gaagcattca gagagcagct ttaaactcag 102 0 

ggacagagcc atgctgggaa gacgcctgag ctcactcggc accctgcaaa atttgatgcc 108 0 

aggacacgct ttggaggcga tttacctgtt ttcgcaccta ccatcaggga caggatgacc 1140 

tggagaactt aggtggcaag ctgtgacttc tccaggtctc acgggcatgg gcactccctt 12 00 

ggtggcaaga gcccccttga caccggggtg gtgggaacca tgaagacagg atgggggctg 12 60 

gcctctggct ctcatggggt ccaagttttg tgtattcttc aacctcattg acaagaactg 1320 
aaaccaccaa aaaaaaaaaa aa 13 42 

<210> 56 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000799 
<400> 56 

tcatggggtc caagttttgt gtattcttca acctcattga caagaactga aaccaccaaa 60 

<210> 57 

<211> 2722 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000917 
<400> 57 

gagcgggctg agggtaggaa gtagccgctc cgagtggagg cgactggggg ctgaagagcg 60 

cgccgccctc tcgtcccact ttccaggtgt gtgatcctgt aaaattaaat cttccaagat 12 0 

gatctggtat atattaatta taggaattct gcttccccag tctttggctc atccaggctt 180 

ttttacttca attggtcaga tgactgattt gatccatact gagaaagatc tggtgacttc 240 

tctgaaagat tatattaagg cagaagagga caagttagaa caaataaaaa aatgggcaga 3 00 

gaagttagat cggctaacta gtacagcgac aaaagatcca gaaggatttg ttgggcatcc 3 60 

agtaaatgca ttcaaattaa tgaaacgtct gaatactgag tggagtgagt tggagaatct 42 0 

ggtccttaag gatatgtcag atggctttat ctctaaccta accattcaga gaccagtact 480 

ttctaatgat gaagatcagg ttggggcagc caaagctctg ttacgtctcc aggataccta 540 

caatttggat acagatacca tctcaaaggg taatcttcca ggagtgaaac acaaatcttt 600 

tctaacggct gaggactgct ttgagttggg caaagtggcc tatacagaag cagattatta 660 

ccatacggaa ctgtggatgg aacaagccct aaggcaactg gatgaaggcg agatttctac 72 0 

catagataaa gtctctgttc tagattattt gagctatgcg gtatatcagc agggagacct 7 80 

ggataaggca cttttgctca caaagaagct tcttgaacta gatcctgaac atcagagagc 840 

taatggtaac ttaaaatatt ttgagtatat aatggctaaa gaaaaagatg tcaataagtc 900 

tgcttcagat gaccaatctg atcagaaaac tacaccaaag aaaaaagggg ttgctgtgga 960 

ttacctgcca gagagacaga agtacgaaat gctgtgccgt ggggagggta tcaaaatgac 102 0 

ccctcggaga cagaaaaaac tcttttgccg ctaccatgat ggaaaccgta atcctaaatt 1080 

tattctggct ccagctaaac aggaggatga atgggacaag cctcgtatta ttcgcttcca 1140 

tgatattatt tctgatgcag aaattgaaat cgtcaaagac ctagcaaaac caaggctgag 1200 

ccgagctaca gtacatgacc ctgagactgg aaaattgacc acagcacagt acagagtatc 126 0 

taagagtgcc tggctctctg gctatgaaaa tcctgtggtg tctcgaatta atatgagaat 1320 

acaagatcta acaggactag atgtttccac agcagaggaa ttacaggtag caaattatgg 1380 

agttggagga cagtatgaac cccattttga ctttgcacgg aaagatgagc cagatgcttt 1440 
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caaagagctg gggacaggaa atagaattgc tacatggctg ttttatatga gtgatgtgtc 1500 

tgcaggagga gccactgttt ttcctgaagt tggagctagt gtttggccca aaaaaggaac 156 0 

tgctgttttc tggtataatc tgtttgccag tggagaagga gattatagta cacggcatgc 162 0 

agcctgtcca gtgctagttg gcaacaaatg ggtatccaat aaatggctcc atgaacgtgg 1680 

acaagaattt cgaagacctt gtacgttgtc agaattggaa tgacaaacag gcttcccttt 1740 

ttctcctatt gttgtactct tatgtgtctg atatacacat ttccatagtc ttaactttca 1800 

ggagtttaca attgactaac actccatgat tgattcagtc atgaacctca tcccatgttt 1860 

catctgtgga caattgctta ctttgtgggt tcttttaaaa gtaacacgaa atcatcatat 192 0 

tgcataaaac cttaaagttc tgttggtatc acagaagaca aggcagagtt taaagtgagg 1980 

aattttatat ttaaagaact ttttggttgg ataaaaacat aatttgagca tccagtttta 2040 

gtatttcact acatctcagt tggtgggtgt taagctagaa tgggctgtgt gataggaaac 2100 

aaatgcctta cagatgtgcc taggtgttct gtttacctag tgtcttactc tgttttctgg 2160 

atctgaagac tagtaataaa ctaggacact aactgggttc catgtgattg ccctttcata 222 0 

tgatcttcta agttgatttt tttcctccca agtctttttt aaagaaagta tactgtattt 22 80 

taccaacccc ctctcttttc ttttagctcc tctgtggtga attaaacgta cttgagttaa 2340 

aatatttcga tttttttttt ttttttaatg gaaagtcctg cataacaaca ctgggccttc 2400 

ttaactaaaa tgctcaccac ttagcctgtt tttttatccc ttttttaaaa tgacagatga 2460 

ttttgttcag gaattttgct gtttttctta gtgctaatac cttgcctctt attcctgcta 2 52 0 

cagcagggtg gtaatattgg cattctgatt aaatactgtg cc ttaggaga ctggaagttt 2 580 

aaaaatgtac aagtcctttc agtgatgagg gaattgattt tttttaaaag tctttttctt 2 640 

agaaagccaa aatgtttgtt tttttaagat tctgaaatgt gttgtgacaa caatgaccta 2 700 
tttatgatct taaatctttt tt 2722 

<210> 58 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000917 
<400> 58 

tcttactctg ttttctggat ctgaagacta gtaataaact aggacactaa ctgggttcca 6 0 

<210> 59 

<211> 3236 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> 3Sny[_001109 
<400> 59 

gacccggcca tgcgcggcct cgggctctgg ctgctgggcg cgatgatgct gcctgcgatt 60 
gcccccagcc ggccctgggc cctcatggag cagtatgagg tcgtgttgcc gcggcgtctg 120 
ccaggccccc gagtccgccg agctctgccc tcccacttgg gcctgcaccc agagagggtg 180 
agctacgtcc ttggggccac agggcacaac ttcaccctcc acctgcggaa gaacagggac 2 40 
ctgctgggtt ccggctacac agagacctat acggctgcca atggctccga ggtgacggag 3 00 
cagcctcgcg ggcaggacca ctgcttatac cagggccacg tagaggggta cccggactca 3 60 
gccgccagcc tcagcacctg tgccggcctc aggggtttct tccaggtggg gtcagacctg 420 
cacctgatcg agcccctgga tgaaggtggc gagggcggac ggcacgccgt gtaccaggct 480 
gagcacctgc tgcagacggc cgggacctgc ggggtcagcg acgacagcct gggcagcctc 5 40 
ctgggacccc ggacggcagc cgtcttcagg cctcggcccg gggactctct gccatcccga 600 
gagacccgct acgtggagct gtatgtggtc gtggacaatg cagagttcca gatgctgggg 660 
agcgaagcag ccgtgcgtca tcgggtgctg gaggtggtga atcacgtgga caagctatat 720 
cagaaactca acttccgtgt ggtcctggtg ggcctggaga tttggaatag tcaggacagg 7 80 
ttccacgtca gccccgaccc cagtgtcaca ctggagaacc tcctgacctg gcaggcacgg 840 
caacggacac ggcggcacct gcatgacaac gtacagctca tcacgggtgt cgacttcacc 900 
gggactactg tggggtttgc cagggtgtcc gccatgtgct cccacagctc aggggctgtg 960 
aaccaggacc acagcaagaa ccccgtgggc gtggcctgca ccatggccca tgagatgggc 102 0 
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cacaacctgg gcatggacca tgatgagaac gtccagggct gccgctgcca ggaacgcttc 108 0 

gaggccggcc gctgcatcat ggcaggcagc attggctcca gtttccccag gatgttcagt 1140 

gactgcagcc aggcctacct ggagagcttt ttggagcggc cgcagtcggt gtgcctcgcc 1200 

aacgcccctg acctcagcca cctggtgggc ggccccgtgt gtgggaacct gtttgtggag 12 6 0 

cgtggggagc agtgcgactg cggccccccc gaggactgcc ggaaccgctg ctgcaactct 132 0 

accacctgcc agctggctga gggggcccag tgtgcgcacg gtacctgctg ccaggagtgc 13 8 0 

aaggtgaagc cggctggtga gctgtgccgt cccaagaagg acatgtgtga cctcgaggag 1440 

ttctgtgacg gccggcaccc tgagtgcccg gaagacgcct tccaggagaa cggcacgccc 1500 

tgctccgggg gctactgcta caacggggcc tgtcccacac tggcccagca gtgccaggcc 15 6 0 

ttctgggggc caggtgggca ggctgccgag gagtcctgct tctcctatga catcctacca 162 0 

ggctgcaagg ccagccggta cagggctgac atgtgtggcg ttctgcagtg caagggtggg 1680 

cagcagcccc tggggcgtgc catctgcatc gtggatgtgt gccacgcgct caccacagag 1740 

gatggcactg cgtatgaacc agtgcccgag ggcacccggt gtggaccaga gaaggtttgc 1800 

fcggaaaggac gttgccagga cttacacgtt tacagatcca gcaactgctc tgcccagtgc 186 0 

cacaaccatg gggtgtgcaa ccacaagcag gagtgccact gccacgcggg ctgggccccg 192 0 

ccccactgcg cgaagctgct gactgaggtg cacgcagcgt ccgggagcct ccccgtcctc 1980 

gtggtggtgg ttctggtgct cctggcagtt gtgctggtca ccctggcagg catcatcgtc 2040 

taccgcaaag cccggagccg catcctgagc aggaacgtgg ctcccaagac cacaatgggg 2100 

cgctccaacc ccctgttcca ccaggctgcc agccgcgtgc cggccaaggg cggggctcca 2160 

gccccatcca ggggccccca agagctggtc cccaccaccc acccgggcca gcccgcccga 2220 

cacccggcct cctcggtggc tctgaagagg ccgccccctg ctcctccggt cactgtgtcc 22 80 

agcccaccct tcccagttcc tgtctacacc cggcaggcac caaagcaggt catcaagcca 2340 

acgttcgcac ccccagtgcc cccagtcaaa cccggggctg gtgcggccaa ccctggtcca 240 0 

gctgagggtg ctgttggccc aaaggttgcc ctgaagcccc ccatccagag gaagcaagga 2460 

gccggagctc ccacagcacc ctaggggggc acctgcgcct gtgtggaaat ttggagaagt 252 0 

tgcggcagag aagccatgcg ttccagcctt ccacggtcca gctagtgccg ctcagcccta 2580 

gaccctgact ttgcaggctc agctgctgtt ctaacctcag taatgcatct acctgagagg 2 640 

ctcctgctgt ccacgccctc agccaattcc ttctccccgc cttggccacg tgtagcccca 2 7 00 

gctgtctgca ggcaccaggc tgggatgagc tgtgtgcttg cgggtgcgtg tgtgtgtacg 2760 

tgtctccagg tggccgctgg tctcccgctg tgttcaggag gccacatata cagcccctcc 2820 

cagccacacc tgcccctgct ctggggcctg ctgagccggc tgccctgggc acccggttcc 2 880 

aggcagcaca gacgtggggc atccccagaa agactccatc ccaggaccag gttcccctcc 2940 

gtgctcttcg agagggtgtc agtgagcaga ctgcacccca agctcccgac tccaggtccc 3 000 

ctgatcttgg gcctgtttcc catgggattc aagagggaca gccccagctt tgtgtgtgtt 3 0 60 

taagcttagg aatgcccttt atggaaaggg ctatgtggga gagtcagcta tcttgtctgg 3120 

ttttcttgag acctcagatg tgtgttcagc agggctgaaa gcttttattc tttaataatg 3180 
agaaatgtat attttactaa taaattattg accgagttct gtagattctt gttaga 32 3 6 

<210> 60 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001109 
<400> 60 

ctttatggaa agggctatgt gggagagtca gctatcttgt ctggttttct tgagacctca 60 

<210> 61 

<211> 1449 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001124 
<400> 61 

ctggatagaa cagctcaagc cttgccactt cgggcttctc actgcagctg ggcttggact 60 

tcggagtttt gccattgcca gtgggacgtc tgagactttc tccttcaagt acttggcaga 120 

tcactctctt agcagggtct gcgcttcgca gccgggatga agctggtttc cgtcgccctg 180 
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atgtacctgg 
gagtttcgaa 
tccagcagct 
cggccccagg 
cgcatccgag 
ggctgccgct 
gataaggaca 
cggcgccggc 
caagcacacg 
ggcgcccatg 
gacttcccga 
aggcaccgtc 
tccttagcct 
ttgccaggct 
tctgagccac 
gcgcaagcct 
ttgtacatac 
acttcaaata 
attatattgt 
ccgcgtggaa 
ggaagaagga 
cagcaaacc 



gttcgctcgc 
agaagtggaa 
accccaccgg 
acatgaaggg 
tcaagcgcta 
tcgggacgtg 
aggacaacgt 
gctccctgcc 
gggctccagc 
gtacaaggaa 
gcggtgtggg 
cggcggcgag 
tgctcaggtg 
taaggagagg 
agccgtgctc 
cactattact 
agaggtaact 
tagagatatt 
cctcccctat 
tgtgagtgtg 
aacaccgagt 
1449 



cttcctaggc 
fcaagtgggct 
gctcgctgac 
tgcctctcga 
ccgccagagc 
cacggtgcag 
cgcccccagg 
cgaggccggc 
ccccccgagt 
tagtcgcgca 
gaccgggctc 
ctctggcttt 
caagtgcccc 
agaaactgag 
gcccacaaac 
tgaactttcc 
atcaatattt 
tttgtacgtt 
tttaagacgt 
tttgtgtgca 
ctctgtataa 



gctgacaccg 
ctgagtcgtg 
gtgaaggccg 
agccccgaag 
atgaacaact 
aagctggcac 
agcaagatca 
ccgggtcgga 
ggaagtgctc 
agcatcccgc 
tgacagccct 
gcaagggccc 
agggggcggg 
aaatgaatgc 
tgatttctca 
aaaacctaaa 
aagtttgttg 
atatattgta 
gaatgtctca 
tgaaagagaa 
tctatttaca 



ctcggttgga 

ggaagaggga 

ggcctgccca 

acagcagtcc 

tccagggcct 

accagatcta 

gcccccaggg 

ctctggtgtc 

cccactttct 

tggtgcctcc 

gcggagaccc 

ctccttctgg 

gtgcagaaga 

tgagaccccc 

cggcgtgtca 

gaggaaaagt 

ctgtcaagat 

ttaagggcat 

gcgaggtgta 

agactgatta 

taaaatgggt 



tgtcgcgtcg 

actgcggatg 

gacccttatt 

ggatgccgcc 

ccggagcttt 

ccagttcaca 

ctacggccgc 

ttctaagcca 

ttaggattta 

cgggacgaag 

tgagtccggg 

gggcttcgct 

atccgagtgt 

ggagcagggg 

ccccaccagg 

gcaatgcgtg 

tttttttgta 

tttaaaagca 

aagttgttcg 

cctcctgtgt 

gatatgcgaa 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 



<210> 62 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM__001124 

gaagjaaaca ccgagtctct gtataatcta tttacataaa atgggtgata tgcgaacagc 60 

<210> 63 

<211> 1619 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001168 



<400> 63 

ccgccagatt 

gacgttgccc 

ctggcccttc 

ccactgcccc 

ggaaggctgg 

cgctttcctt 

ggacagagaa 

tgaggaaact 

cctctggccg 

gtgccaccag 

caaattagat 

tgcctgtgca 

gggggctcat 

aaggcagtgt 

gtgaatgtgt 

ggtgcctgtt 

acagtttttt 

gtgatgagag 



tgaatcgcgg 

cctgcctggc 

ttggagggct 

actgagaacg 

gagccagatg 

tctgtcaaga 

agagccaaga 

gcgaagaaag 

gagctgcctg 

ccttcctgtg 

gtttcaactg 

gcgggtgctg 

ttttgctgtt 

cccttttgct 

ctggacctca 

gaatctgagc 

tgttgttgtg 

aatggagaca 



gacccgttgg 

agccctttct 

gcgcctgcac 

agccagactt 

acgaccccat 

agcagtttga 

acaaaattgc 

tgcgccgtgc 

gtcccagagt 

ggccccttag 

tgctcctgtt 

ctggtaacag 

ttgattcccg 

agagctgaca 

tgttgttgag 

tgcaggttcc 

tttttttgtt 

gagtccctgg 



cagaggtggc 

caaggaccac 

cccggagcgg 

ggcccagtgt 

agaggaacat 

agaattaacc 

aaaggaaacc 

catcgagcag 

ggctgcacca 

caatgtctta 

ttgtcttgaa 

tggctgcttc 

ggcttaccag 

gctttgttcg 

gctgtcacag 

ttatctgtca 

tttttttttt 

ctcctctact 



ggcggcggca 

cgcatctcta 

atggccgagg 

ttcttctgct 

aaaaagcatt 

cttggtgaat 

aacaataaga 

ctggctgcca 

cttccagggt 

ggaaaggaga 

agtggcacca 

tctctctctc 

gtgagaagtg 

cgtgggcaga 

tcctgagtgt 

cacctgtgcc 

ggtagatgca 

gtttaacaac 



tgggtgcccc 

cattcaagaa 

ctggcttcat 

tcaaggagct 

cgtccggttg 

ttttgaaact 

agaaagaatt 

tggattgagg 

ttattccctg 

tcaacatttt 

gaggtgcttc 

tctctttttt 

agggaggaag 

gccttccaca 

ggacttggca 

tcctcagagg 

tgacttgtgt 

atggctttct 



60 

120 

180 

240 

300 

360 

42 0 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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tattttgttt gaattgttaa ttcacagaat agcacaaact acaattaaaa ctaagcacaa 1140 

agccattcta agtcattggg gaaacggggt gaacttcagg tggatgagga gacagaatag 12 00 

agtgatagga agcgtctggc agatactcct tttgccactg ctgtgtgatt agacaggccc 12 60 

agtgagccgc ggggcacatg ctggccgctc ctccctcaga aaaaggcagt ggcctaaatc 132 0 

ctttttaaat gacttggctc gatgctgtgg gggactggct gggctgctgc aggccgtgtg 13 80 

tctgtcagcc caaccttcac atctgtcacg ttctccacac gggggagaga cgcagtccgc 1440 

ccaggtcccc gctttctttg gaggcagcag ctcccgcagg gctgaagtct ggcgtaagat 1500 

gatggatttg attcgccctc ctccctgtca tagagctgca gggtggattg ttacagcttc 1560 
gctggaaacc tctggaggtc atctcggctg ttcctgagaa ataaaaagcc tgtcatttc 1619 

<210> 64 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001168 
<400> 64 

ttcacagaat agcacaaact acaattaaaa ctaagcacaa agccattcta agtcattggg 60 

<210> 65 

<211> 1552 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001216 
<400> 65 

gcccgtacac accgtgtgct gggacacccc acagtcagcc gcatggctcc cctgtgcccc 60 

agcccctggc tccctctgtt gatcccggcc cctgctccag gcctcactgt gcaactgctg 12 0 

ctgtcactgc tgcttctgat gcctgtccat ccccagaggt tgccccggat gcaggaggat 180 

tcccccttgg gaggaggctc ttctggggaa gatgacccac tgggcgagga ggatctgccc 240 

agtgaagagg attcacccag agaggaggat ccacccggag aggaggatct acctggagag 3 00 

gaggatctac ctggagagga ggatctacct gaagttaagc ctaaatcaga agaagagggc 3 60 

tccctgaagt tagaggatct acctactgtt gaggctcctg gagatcctca agaaccccag 42 0 

aataatgccc acagggacaa agaaggggat gaccagagtc attggcgcta tggaggcgac 480 

ccgccctggc cccgggtgtc cccagcctgc gcgggccgct tccagtcccc ggtggatatc 540 

cgcccccagc tcgccgcctt ctgcccggcc ctgcgccccc tggaactcct gggcttccag 6 00 

ctcccgccgc tcccagaact gcgcctgcgc aacaatggcc acagtgtgca actgaccctg 660 

cctcctgggc tagagatggc tctgggtccc gggcgggagt accgggctct gcagctgcat 72 0 

ctgcactggg gggctgcagg tcgtccgggc tcggagcaca ctgtggaagg ccaccgtttc 7 80 
cctgccgaga tccacgtggt tcacctcagc accgcctttg ccagagttga cgaggccttg 84 0 
gggcgcccgg gaggcctggc cgtgttggcc gcctttctgg aggagggccc ggaagaaaac 900 
agtgcctatg agcagttgct gtctcgcttg gaagaaatcg ctgaggaagg ctcagagact 960 
caggtcccag gactggacat atctgcactc ctgccctctg acttcagccg ctacttccaa 1020 
tatgaggggt ctctgactac accgccctgt gcccagggtg tcatctggac tgtgtttaac 10 80 
cagacagtga tgctgagtgc taagcagctc cacaccctct ctgacaccct gtggggacct 1140 
ggtgactctc ggctacagct gaacttccga gcgacgcagc ctttgaatgg gcgagtgatt 12 00 
gaggcctcct tccctgctgg agtggacagc agtcctcggg ctgctgagcc agtccagctg 12 60 
aattcctgcc tggctgctgg tgacatccta gccctggttt ttggcctcct ttttgctgtc 13 20 
accagcgtcg cgttccttgt gcagatgaga aggcagcaca gaaggggaac caaagggggt 13 80 
gtgagctacc gcccagcaga ggtagccgag actggagcct agaggctgga tcttggagaa 1440 
tgtgagaagc cagccagagg catctgaggg ggagccggta actgtcctgt cctgctcatt 15 00 
atgccacttc cttttaactg ccaagaaatt ttttaaaata aatatttata at 1552 

<210> 66 

<211> 60 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> NM_001216 



<400> 66 

tcctgtcctg ctcattatgc cacttccttt taactgccaa gaaatttttt aaaataaata 6 0 



<210> 67 

<211> 2653 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001254 



<400> 67 

gagcgcggct ggagtttgct gctgccgctg tgcagtttgt tcaggggctt gtggtggtga 6 0 

gtccgagagg ctgcgtgtga gagacgtgag aaggatcctg cactgaggag gtggaaagaa 12 0 

gaggattgct cgaggaggcc tggggtctgt gaggcagcgg agctgggtga aggctgcggg 180 

ttccggcgag gcctgagctg tgctgtcgtc atgcctcaaa cccgatccca ggcacaggct 2 40 

acaatcagtt ttccaaaaag gaagctgtct cgggcattga acaaagctaa aaactccagt 3 00 

gatgccaaac tagaaccaac aaatgtccaa accgtaacct gttctcctcg tgtaaaagcc 3 60 

ctgcctctca gccccaggaa acgtctgggc gatgacaacc tatgcaacac tccccattta 42 0 

cctccttgtt ctccaccaaa gcaaggcaag aaagagaatg gtccccctca ctcacataca 48 0 

cttaagggac gaagattggt atttgacaat cagctgacaa ttaagtctcc tagcaaaaga 54 0 

gaactagcca aagttcacca aaacaaaata ctttcttcag ttagaaaaag tcaagagatc 600 

acaacaaatt ctgagcagag atgtccactg aagaaagaat ctgcatgtgt gagactattc 660 

aagcaagaag gcacttgcta ccagcaagca aagctggtcc tgaacacagc tgtcccagat 72 0 

cggctgcctg ccagggaaag ggagatggat gtcatcagga attbcttgag ggaacacatc 7 80 

tgtgggaaaa aagctggaag cctttacctt tctggtgctc ctggaactgg aaaaactgcc 840 

tgcttaagcc ggattctgca agacctcaag aaggaactga aaggctttaa aactatcatg 9 00 

ctgaattgca tgtccttgag gactgcccag gctgtattcc cagctattgc tcaggagatt 9 60 

tgtcaggaag aggtatccag gccagctggg aaggacatga tgaggaaatt ggaaaaacat 1020 

atgactgcag agaagggccc catgattgtg ttggtattgg acgagatgga tcaactggac 10 80 

agcaaaggcc aggatgtatt gtacacgcta tttgaatggc catggctaag caattctcac 1140 

ttggtgctga ttggtattgc taataccctg gatctcacag atagaattct acctaggctt 12 00 

caagctagag aaaaatgtaa gccacagctg ttgaacttcc caccttatac cagaaatcag 12 60 

atagtcacta ttttgcaaga tcgacttaat caggtatcta gagatcaggt tctggacaat 1320 

gctgcagttc aattctgtgc ccgcaaagtc tctgctgttt caggagatgt tcgcaaagca 13 80 

ctggatgttt gcaggagagc tattgaaatt gtagagtcag atgtcaaaag ccagactatt 1440 

ctcaaaccac tgtctgaatg taaatcacct tctgagcctc tgattcccaa gagggttggt 1500 

cttattcaca tatcccaagt catctcagaa gttgatggta acaggatgac cttgagccaa 15 60 

gaaggagcac aagattcctt ccctcttcag cagaagatct tggtttgctc tttgatgctc 1620 

ttgatcaggc agttgaaaat caaagaggtc acfcctgggga agttatatga agcctacagt 16 80 

aaagtctgtc gcaaacagca ggtggcggct gtggaccagt cagagtgttt gtcactttca 1740 

gggctcttgg aagccagggg cattttagga ttaaagagaa acaaggaaac ccgtttgaca 1800 

aaggtgtttt tcaagattga agagaaagaa atagaacatg ctctgaaaga taaagcttta 1860 

attggaaata tcttagctac tggattgcct taaattcttc tcttacaccc cacccgaaag 192 0 

tattcagctg gcatttagag agctacagtc ttcattttag tgctttacac attcgggcct 19 80 

gaaaacaaat atgacctttt ttacttgaag ccaatgaatt ttaatctata gattctttaa 2 040 

tattagcaca gaataatatc tttgggtctt actattttta cccataaaag tgaccaggta 2100 

gacccttttt aattacattc actacttcta ccacttgtgt atctctagcc aatgtgcttg 2160 

caagtgtaca gatctgtgta gaggaatgtg tgtatattta cctcttcgtt tgchcaaaca 222 0 

fcgagtgggta tttttttgtt tgtttttttt gttgttgttg tttttgaggc gcgtctcacc 2 2 80 

ctgttgccca ggctggagtg caatggcgcg ttctctgctc actacagcac ccgcttccca 2340 

ggttgaagtg attctcttgc ctcagcctcc cgagtagctg ggattacagg tgcccaccac 2 400 

cgcgcccagc taatttttta atttttagta gagacagggt tttaccatgt tggccaggct 2 460 

ggtcttgaac tcctgaccct caagtgatct gcccaccttg gcctccctaa gtgctgggat 2 520 

tataggcgtg agccaccatg ctcagccatt aaggtatttt gttaagaact ttaagtttag 2 5 80 

ggtaagaaga atgaaaatga tccagaaaaa tgcaagcaag tccacatgga gatttggagg 2 640 
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60 



acactggtta aag 2 653 

<210> 68 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_001254 
<400> 68 

caaggaaacc cgtttgacaa aggtgttttt caagattgaa gagaaagaaa tagaacatgc 

<210> 69 
<211> 627 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_001323 
<400> 69 

gcggccgcaa gctcggcact cacggctctg agggctccga cggcactgac ggccatggcg 
cgttcgaacc tcccgctggc gctgggcctg gccctggtcg cattctgcct cctggcgctg 120 
ccacgcgacg cccgggcccg gccgcaggag cgcatggtcg gagaactccg ggacctgtcg 180 
cccgacgacc cgcaggtgca gaaggcggcg caggcggccg tggccagcta caacatgggc 240 
agcaacagca tctactactt ccgagacacg cacatcatca aggcgcagag ccagctggtg 3 00 
gccggcatca agtacttcct gacgatggag atggggagca cagactgccg caagaccagg 3 60 
gtcactggag accacgtcga cctcaccact tgccccctgg cagcaggggc gcagcaggag 42 0 
aagctgcgct gtgactttga ggtccttgtg gttccctggc agaactcctc tcagctccta 4 80 
aagcacaact gtgtgcagat gtgataagtc cccgagggcg aaggccattg ggtttggggc 540 
catggtggag ggcacttcag gtccgtgggc cgtatctgtc acaataaatg gccagtgctg 6 00 
cttcttgcaa aaaaaaaaaa aaaaaaa 627 

<210> 70 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM__001323 



60 



<400> 70 

atcaagtact tcctgacgat ggagatgggg agcacagact gccgcaagac cagggtcact 

<210> 71 

<211> 1812 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001428 



60 



60 



<400> 71 

tagctaggca ggaagtcggc gcgggcggcg cggacagtat ctgtgggtac ccggagcacg 

gagatctcgc cggctttacg ttcacctcgg tgtctgcagc accctccgct tcctctccta 120 

ggcgacgaga cccagtggct agaagttcac catgtctatt ctcaagatcc atgccaggga 180 

gatctttgac tctcgcggga atcccactgt tgaggttgat ctcttcacct caaaaggtct 240 

cttcagagct gctgtgccca gtggtgcttc aactggtatc tatgaggccc tagagctccg 3 00 

ggacaatgat aagactcgct atatggggaa gggtgtctca aaggctgttg agcacatcaa 3 60 

taaaactatt gcgcctgccc tggttagcaa gaaactgaac gtcacagaac aagagaagat 420 
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tgacaaactg 
cattctgggg 
gtaccgccac 
caatgtcatc 
cctcccagtc 
caacctgaag 
aggcgggttt 
tattgggaaa 
gttcttcagg 
catctcgcct 
gtctatcgaa 
tgcaggaatc 
ggccgtgaac 
gaccgagtct 
tcatcgttcg 
tgggcagatc 
cctcagaatt 
ccccttggcc 
cccctcccct 
ctgctagtta 
gccaagctcc 
tcattgtttt 
tctccggggt 
gtgacccatg 



at gate gaga 
gtgtcccttg 
ategctgact 
aatggcggtt 
ggtgcagcaa 
aatgtcatca 
gctcccaaca 
gctggctaca 
tctgggaagt 
gaccagctgg 
gatccctttg 
caggtagtgg 
gagaagtcct 
ettcaggegt 

ggggagactg 

aagactggtg 
gaagaggagc 
aagtaagctg 
cgtgtcagct 
gcgccccacc 
ctggagccct 
tctcgcctca 
ggccacaggc 
ag 1812 



tggatggaac 
ccgtctgcaa 
tggctggcaa 
etcatgetgg 
acttcaggga 
aggagaaata 
tcctggagaa 
ctgataaggt 
atgacctgga 
ctgacctgta 
accaggatga 
gggatgatct 
gcaactgcct 
gcaagctggc 
aagatacctt 
ccccttgccg 
tgggcagcaa 
tgggcaggca 
caggcagctc 
gccgtggagt 
gttggcagct 
ctttccacca 
tagatccccg 



agaaaataaa 
agctggtgcc 
ctctgaagtc 
caacaagctg 
agccatgcgc 
tgggaaagat 
taaagaaggc 
ggtcategge 
cttcaagtct 
caagtccttc 
ctggggagct 
cacagtgacc 
cctgctcaaa 
ccaggccaat 
catcgctgac 
atetgagege 
ggctaagttt 
agcccttcgg 
gaggcccccg 
tcgtaccgct 
etagctttge 
agtgtctaga. 
gtggttttgt 



tctaagtttg 
gttgagaagg 
atcctgccag 
gecatgeagg 
attggagcag 
gccaccaatg 
ctggagctgc 
atggacgtag 
cccgatgacc 
atcaaggact 
tggcagaagt 
aacccaaaga 
gtcaaccaga 
ggttggggcg 
ctggttgtgg 
ttggccaagt 
geeggcagga 
tcacctgttg 
accaacactt 
tccttagaac 
agtcgtgtaa 
gtcatgtgag 
gctcaaaata 



gtgegaaege 
gggtccccct 
tcccggcgtt 
agttcatgat 
aggtttacca 
tgggggatga 
tgaagactgc 
cggcctccga 
ccagcaggta 
acccagtggt 
tcacagccag 
ggatcgccaa 
ttggctccgt 
tcatggtgtc 
ggctgtgcac 
acaaccagct 
acttcagaaa 
gctacacaga 
gcaggggtcc 
ttctacagaa 
ttggcccaag 
cctcgtgtca 
aaaagectea 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 



<210> 72 
<211> 60 

<212> DNA 
<213> Homo 



sapiens 



<300> 

<308> NM_001428 



<400> 72 

agctctagct tttgeagteg tgtaatgggc ccaagtcatt gtttttctcg cctcactttc 60 

<210> 73 

<211> 8368 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_001456 



<400> 73 

gcgatccggg cgccaccccg eggtcategg 
acctctgctc cctgcctcgc ctcccgcgcg 
cgtcccctcg ccgaggctgc agcaccgccc 
ctcccactct cgggcgggcc agagegcage 
gcgggacgcc gagatgeegg ccaccgagaa 
gatccagcag aacactttca cgcgctggtg 
catcgccaac ctgeagaegg acctgagega 

gctcagccag aagaagatgc accgcaagca 
gcttgagaac gtgtcggtgg cgctcgagtt 
catcgacagc aaggecateg tggacgggaa 
cctgatcctg cactactcca tctccatgcc 
caagaagcag acccccaagc agaggctcct 
gcccatcacc aacttcagcc gggactggca 
cagctgtgcc ccgggcctgt gtcctgactg 



tcaccggtcg ctctcaggaa cagcagcgca 60 

cctaggtgcc tgegacttta attaaagggc 120 

ccccggcttc tcgcgcctca aaatgagtag 180 

aggegegget ccgggcggcg gcgtcgacac 240 

ggacctggcg gaggacgege cgtggaagaa 3 00 

caacgagcac ctgaagtgcg tgagcaagcg 360 

egggctgegg cttatcgcgc tgttggaggt 420 

caaccagcgg cccactttcc gecaaatgea 480 

cctggaccgc gagagcatca aactggtgtc 540 

cctgaagctg atcctgggcc tcatctggac 600 

catgtgggac gaggaggagg atgaggaggc 660 

gggctggatc cagaacaagc tgccgcagct 72 0 

gageggcegg gccctgggcg ccctggtgga 780 

ggactcttgg gacgccagca agcccgttac 840 
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caatgcgcga gaggccatgc agcaggcgga tgactggctg ggcatccccc aggtgatcac 9 00 

ccccgaggag attgtggacc ccaacgtgga cgagcactct gtcatgacct acctgtccca 960 

gttccccaag gccaagctga agccaggggc tcccttgcgc cccaaactga acccgaagaa 1020 

agcccgtgcc tacgggccag gcatcgagcc cacaggcaac atggtgaaga agcgggcaga 1080 

gttcactgtg gagaccagaa gtgctggcca gggagaggtg ctggtgtacg tggaggaccc 1140 

ggccggacac caggaggagg caaaagtgac cgccaataac gacaagaacc gcaccttctc 12 00 

cgtctggtac gtccccgagg tgacggggac 1 tcataaggtt actgtgctct ttgctggcca 12 60 

gcacatcgcc aagagcccct tcgaggtgta cgtggataag tcacagggtg acgccagcaa 132 0 

agtgacagcc caaggtcccg gcctggagcc cagtggcaac atcgccaaca agaccaccta 13 80 

ctttgagatc tttacggcag gagctggcac gggcgaggtc gaggttgtga tccaggaccc 144 0 

catgggacag aagggcacgg tagagcctca gctggaggcc cggggcgaca gcacataccg 150 0 

ctgcagctac cagcccacca tggagggcgt ccacaccgtg cacgtcacgt ttgccggcgt 156 0 

gcccatccct cgcagcccct acactgtcac tgttggccaa gcctgtaacc cgagtgcctg 162 0 

ccgggcggtt ggccggggcc tccagcccaa gggtgtgcgg gtgaaggaga cagctgactt 1680 

caaggtgtac acaaagggcg ctggcagtgg ggagctgaag gtcaccgtga agggccccaa 1740 

gggagaggag cgcgtgaagc agaaggacct gggggatggc gtgtatggct tcgagtatta 180 0 

ccccatggtc cctggaacct atatcgtcac catcacgtgg ggtggtcaga acatcgggcg 186 0 

cagtcccttc gaagtgaagg tgggcaccga gtgtggcaat cagaaggtac gggcctgggg 192 0 

ccctgggctg gagggcggcg tcgttggcaa gtcagcagac tttgtggtgg aggctatcgg 1980 

ggacgacgtg ggcacgctgg gcttctcggt ggaagggcca tcgcaggcta agatcgaatg 204 0 

tgacgacaag ggcgacggct cctgtgatgt gcgctactgg ccgcaggagg ctggcgagta 210 0 

tgccgttcac gtgctgtgca acagcgaaga catccgcctc agccccttca tggctgacat 216 0 

ccgtgacgcg ccccaggact tccacccaga cagggtgaag gcacgtgggc ctggattgga 222 0 

gaagacaggt gtggccgtca acaagccagc agagttcaca gtggatgcca agcacggtgg 22 80 

caaggcccca cttcgggtcc aagtccagga caatgaaggc tgccctgtgg aggcgttggt 2340 

caaggacaac ggcaatggca cttacagctg ctcctacgtg cccaggaagc cggtgaagca 24 0 0 

cacagccatg gtgtcctggg gaggcgtcag catccccaac agccccttca gggtgaatgt 2460 

gggagctggc agccacccca acaaggtcaa agtatacggc cccggagtag ccaagacagg 252 0 

gctcaaggcc cacgagccca cctacttcac tgtggactgc gccgaggctg gccaggggga 25 80 

cgtcagcatc ggcatcaagt gtgcccctgg agtggtaggc cccgccgaag ctgacatcga 2 640 

cttcgacatc atccgcaatg acaatgacac cttcacggtc aagtacacgc cccggggggc 27 00 

tggcagctac accattatgg tcctctttgc tgaccaggcc acgcccacca gccccatccg 2760 

agtcaaggtg gagccctctc atgacgccag taaggtgaag gccgagggcc ctggcctcag 2 82 0 

tcgcactggt gtcgagcttg : gcaagcccac ccacttcaca gtaaatgcca aagctgctgg 2880 

caaaggcaag ctggacgtcc agttctcagg actcaccaag ggggatgcag tgcgagatgt 2 940 

ggacatcatc gaccaccatg acaacaccta cacagtcaag tacacgcctg tccagcaggg 3 000 

tccagtaggc gtcaatgtca cttatggagg ggatcccatc cctaagagcc ctttctcagt 3 060 

ggcagtatct ccaagcctgg acctcagcaa gatcaaggtg tctggcctgg gagagaaggt 3120 

ggacgttggc aaagaccagg agttcacagt caaatcaaag ggtgctggtg gtcaaggcaa 3180 

agtggcatcc aagattgtgg gcccctcggg tgcagcggtg ccctgcaagg tggagccagg 3 2 40 

cctgggggct gacaacagtg tggtgcgctt cctgccccgt gaggaagggc cctatgaggt 33 00 

ggaggtgacc tatgacggcg tgcccgtgcc tggcagcccc tttcctctgg aagctgtggc 33 60 

ccccaccaag cctagcaagg tgaaggcgtt tgggccgggg ctgcagggag gcagtgcggg 3420 

ctcccccgcc cgcttcacca tcgacaccaa gggcgccggc acaggtggcc tgggcctgac 34 80 

ggtggagggc ccctgtgagg cgcagctcga gtgcttggac aatggggatg gcacatgttc 3 540 

cgtgtcctac gtgcccaccg agcccgggga ctacaacatc aacatcctct tcgctgacac 3 600 

ccacatccct ggctccccat tcaaggccca cgtggttccc tgctttgacg catccaaagt 3 660 

caagtgctca ggccccgggc tggagcgggc caccgctggg gaggtgggcc aattccaagt 3 72 0 

ggactgctcg agcgcgggca gcgcggagct gaccattgag atctgctcgg aggcggggct 3 7 80 

tccggccgag gtgtacatcc aggaccacgg tgatggcacg cacaccatta cctacattcc 3 840 

cctctgcccc ggggcctaca ccgtcaccat caagtacggc ggccagcccg tgcccaactt 3 9 00 

ccccagcaag ctgcaggtgg aacctgcggt ggacacttcc ggtgtccagt gctatgggcc 39 60 

tggtattgag ggccagggtg tcttccgtga ggccaccact gagttcagtg tggacgcccg 402 0 

ggctctgaca cagaccggag ggccgcacgt caaggcccgt gtggccaacc cctcaggcaa 4080 

cctgacggag acctacgttc aggaccgtgg cgatggcatg tacaaagtgg agtacacgcc 4140 

ttacgaggag ggactgcact ccgtggacgt gacctatgac ggcagtcccg tgcccagcag 42 00 

ccccttccag gtgcccgtga ccgagggctg cgacccctcc cgggtgcgtg tccacgggcc 42 60 
aggcatccaa agtggcacca ccaacaagcc caacaagttc actgtggaga ccaggggagc 432 0 
tggcacgggc ggcctgggcc tggctgtaga gggcccctcc gaggccaaga tgtcctgcat 43 80 
ggataacaag gacggcagct gctcggtcga gtacatccct tatgaggctg gcacctacag 4440 
cctcaacgtc acctatggtg gccatcaagt gccaggcagt cctttcaagg tccctgtgca 4500 
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tgatgtgaca gatgcgtcca aggtcaagtg ctctgggccc ggcctgagcc caggcatggt 4560 

tcgtgccaac ctccctcagt ccttccaggt ggacacaagc aaggctggtg tggccccatt 462 0 

gcaggtcaaa gtgcaagggc ccaaaggcct ggtggagcca gtggacgtgg tagacaacgc 4680 

tgatggcacc cagaccgtca attatgtgcc cagccgagaa gggccctaca gcatctcagt 4740 

actgtatgga gatgaagagg taccccggag ccccttcaag gtcaaggtgc tgcctactca 4800 

tgatgccagc aaggtgaagg ccagtggccc cgggctcaac accactggcg tgcctgccag 4860 

cctgcccgtg gagttcacca tcgatgcaaa ggacgccggg gagggcctgc tggctgtcca 492 0 

gatcacggat cccgaaggca agccgaagaa gacacacatc caagacaacc atgacggcac 49 80 

gtatacagtg gcctacgtgc cagacgtgac aggtcgctac accatcctca tcaagtacgg 504 0 

tggtgacgag atccccttct ccccgtaccg cgtgcgtgcc gtgcccaccg gggacgccag 5100 

caagtgcact gtcacagtgt caatcggagg tcacgggcta ggtgctggca tcggccccac 5160 

cattcagatt ggggaggaga cggtgatcac tgtggacact aaggcggcag gcaaaggcaa 522 0 

agtgacgtgc accgtgtgca cgcctgatgg ctcagaggtg gatgtggacg tggtggagaa 52 80 

tgaggacggc actttcgaca tcttctacac ggccccccag ccgggcaaat acgtcatctg 5340 

tgtgcgcttt ggtggcgagc acgtgcccaa cagccccttc caagtgacgg ctctggctgg 5400 

ggaccagccc tcggtgcagc cccctctacg gtctcagcag ctggccccac agtacaccta 546 0 

cgcccagggc ggccagcaga cttgggcccc ggagaggccc ctggtgggtg tcaatgggct 552 0 

ggatgtgacc agcctgaggc cctttgacct tgtcatcccc ttcaccatca agaagggcga 5580 

gatcacaggg gaggttcgga tgccctcagg caaggtggcg cagcccacca tcactgacaa 564 0 

caaagacggc accgtgaccg tgcggtatgc acccagcgag gctggcctgc acgagatgga 57 00 

catccgctat gacaacatgc acatcccagg aagccccttg cagttctatg tggattacgt 5760 

caactgtggc catgtcactg cctatgggcc tggcctcacc catggagtag tgaacaagcc 582 0 

tgccaccttc accgtcaaca ccaaggatgc aggagagggg ggcctgtctc tggccattga 5880 

gggcccgtcc aaagcagaaa tcagctgcac tgacaaccag gatgggacat gcagcgtgtc 5940 

ctacctgcct gtgctgccgg gggactacag cattctagtc aagtacaatg aacagcacgt 6000 

cccaggcagc cccttcactg ctcgggtcac aggtgacgac tccatgcgta tgtcccacct 6060 

aaaggtcggc tctgctgccg acatccccat caacatctca gagacggatc tcagcctgct 612 0 

gacggccact gtggtcccgc cctcgggccg ggaggagccc tgtttgctga agcggctgcg 6180 

taatggccac gtggggattt cattcgtgcc caaggagacg ggggagcacc tggtgcatgt 6240 

gaagaaaaat ggccagcacg tggccagcag ccccatcccg gtggtgatca gccagtcgga 63 0 0 

aattggggat gccagtcgtg ttcgggtctc tggtcagggc cttcacgaag gccacacctt 6360 

tgagcctgca gagtttatca ttgatacccg cgatgcaggc tatggtgggc tcagcctgtc 6420 

cattgagggc cccagcaagg tggacatcaa cacagaggac ctggaggacg ggacgtgcag 6480 

ggtcacctac tgccccacag agccaggcaa ctacatcatc aacatcaagt ttgccgacca 6540 

gcacgtgcct ggcagcccct tctctgtgaa ggtgacaggc gagggccggg tgaaagagag 6 600 

catcacccgc aggcgtcggg ctccttcagt ggccaacgtt ggtagtcatt gtgacctcag 6660 

cctgaaaatc cctgaaatta gcatccagga tatgacagcc caggtgacca gcccatcggg 6720 

caagacccat gaggccgaga tcgtggaagg ggagaaccac acctactgca tccgctttgt 6780 

tcccgctgag atgggcacac acacagtcag cgtcaagtac aagggccagc acgtgcctgg 6840 

gagccccttc cagttcaccg tggggcccct aggggaaggg ggagcccaca aggtccgagc 69 00 

tgggggccct ggcctggaga gagctgaagc tggagtgcca gccgaattca gtatctggac 6960 

ccgggaagct ggtgctggag gcctggccat tgctgtcgag ggccccagca aggctgagat 7 020 

ctcttttgag gaccgcaagg acggctcctg tggtgtggct tatgtggtcc aggagccagg 7 080 

tgactacgaa gtctcagtca agttcaacga ggaacacatt cccgacagcc ccttcgtggt 7140 

gcctgtggct tctccgtctg gcgacgcccg ccgcctcact gtttctagcc ttcaggagtc 72 00 

agggctaaag gtcaaccagc cagcctcttt tgcagtcagc ctgaacgggg ccaagggggc 72 60 

gatcgatgcc aaggtgcaca gcccctcagg agccctggag gagtgctatg tcacagaaat 732 0 

tgaccaagat aagtatgctg tgcgcttcat ccctcgggag aatggcgttt acctgattga 73 80 

cgtcaagttc aacggtaccc acatccctgg aagccccttc aagatccgag ttggggagcc 7440 

tgggcatgga ggggacccag gcttggtgtc tgcttacgga gcaggtctgg aaggcggtgt 7500 

cacagggaac ccagctgagt tcgtcgtgaa cacgagcaat gcgggagctg gtgccctgtc 7560 

ggtgaccatt gacggcccct ccaaggtgaa gatggattgc caggagtgcc ctgagggcta 762 0 

ccgcgtcacc tataccccca tggcacctgg cagctacctc atctccatca agtacggcgg 7 680 

cccctaccac attgggggca gccccttcaa ggccaaagtc acaggccccc gtctcgtcag 774 0 

caaccacagc ctccacgaga catcatcagt gtttgtagac tctctgacca aggccacctg 7800 

tgccccccag catggggccc cgggtcctgg gcctgctgac gccagcaagg tggtggccaa 7860 

gggcctgggg ctgagcaagg cctacgtagg ccagaagagc agcttcacag tagactgcag 792 0 

caaagcaggc aacaacatgc tgctggtggg ggttcatggc ccaaggaccc cctgcgagga 7980 

gatcctggtg aagcacgtgg gcagccggct ctacagcgtg tcctacctgc tcaaggacaa 8040 

gggggagtac acactggtgg tcaaatgggg gcacgagcac atcccaggca gcccctaccg 8100 

cgttgtggtg ccctgagtct ggggcccgtg ccagccggca gcccccaagc ctgccccgct 8160 
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acccaagcag ccccgccctc ttcccctcaa ccccggccca ggccgccctg gccgcccgcc 8220 

tg?cac?gca gctgcccctg ccctgtgccg tgctgcgctc acctgcctcc ccagccagc 8280 

gctgacctct cggctttcac ttgggcagag ggagccattt ggtggcgctg cttgtcttct 8340 
ttggttctgg gaggggtgag ggatgggg 83 6 8 

<210> 74 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_0 01456 

JgacctcJcg gctttcactt gggcagaggg agccatttgg tggcgctgct tgtcttcttt 60 

<210> 75 
<211> 1642 
<212> DNA „.'- : t. < 
<213> Homo sapiens 

<300> 

<308> NM_001548 

ccagat'ctca gaggagcctg gctaagcaaa accctgcaga acggctgcct ""tacagc 60 
aaccatgagt acaaatggtg atgatcatca ggtcaaggat agtctggagc aattgagatg 120 
SactXca tgggag??at ccattgatga cgatgaaatg -tgatttag -aacagagt 180 
cttggatcag attgaattcc tagacaccaa atacagtgtg ggaatacaca acctactagc 240 
ctaKgaaa cacotgaaag gccagaatga ggaagccctg aagagcttaa -agaagctga 300 
aaJtaatg caggaagaac atgacaacca agcaaatgtg a^agtctgg tgacctgggg 36 0 
caactttgcc tggatgtatt accacatggg cagactggca gaagcccaga cttacctgga 420 
caaggtggag aacatttgca agaagctttc aaatcccttc cgctatagaa tggagtgtcc 480 
SaEtagac tgtgaggaag gatgggcctt gctgaagtgt aaaggaaaga attatgaacg 540 
aaccaaggcc tgctttgaaa aggtgcttga agtggaccct gaaaaccctg aatccagcgc 600 
SggtaScg aLtctgcct atcgcctgga tggctttaaa ttagccacaa aaaatcacaa 660 
gccattttct ttgcttcccc taaggcaggc tgtccgctta aatccagaca atggatatat 720 
?aaggttctc ct?gccctga agcttcagga tgaaggacag gaagctgaag gagaaaagta 780 
cat?gaagaa gctctagcca acatgtcctc acagacctat gtctttcgat atgcagccaa 840 
attt?accga agaaaaggct ctgtggataa agctcttgag ttattaaaaa aggccttgca 
ggaaaclccc acttctgLt tactgcatca ccagataggg ctttgctaca aggcacaaat 
gatccaaatc aaggaggcta caaaagggca gcctagaggg cagaacagag ^aaagctaga 
caaaataata agatcagcca tatttcattt tgaatctgca gtggaaaaaa agcccacatt 
tgaggtggc? catctagacc tggcaagaat gtatatagaa gcaggcaatc acagaaaagc 
tgaagagaat tttcaaaaat tgttatgcat gaaaccagtg gtagaagaaa ^aatgcaaga 1200 
catacatttc tactatggtc ggtttcagga atttcaaaag aaatctgacg aatgcaat 1260 
tatccattat ttaaaagcta taaaaataga acaggcatca ttaacaaggg ^aaaagtat 1320 
caattctttg aagaaattgg ttttaaggaa acttcggaga aaggcattag atctggaaag 1380 
cKgaacctc ct?gggttcg tctataaatt ggaaggaaat atgaatgaag ccctggagta 1440 
cta?gagcgg gccSgagac tggctgctga ctttgagaac tctgtgagac -ggtcctta 
ggcacccaga tatcagccac tttcacattt catttcattt tatgctaaca "tactaatc 
atcttttctg cttactgttt tcagaaacat tataattcac tgtaatgatg taattcttga 
ataataaatc tgacaaaata tt 1642 



900 

960 

1020 

1080 

1140 



1500 
1560 
1620 



<210> 76 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 



37 



WO 2005/086891 



PCT/US2005/007894 



<308> NM_001548 



<400> 76 

gtatcaattc tttgaagaaa ttggttttaa ggaaacttcg gagaaaggca ttagatctgg 6 0 

<210> 77 
<211> 3344 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_001605 



<400> 77 
ggtacagctg 
aggagctggg 
ctctaacagc 
cgtatgttca 
caggcatgaa 
caaagctgag 
acctggacga 
ggtcttttgg 
aagagtttgg 
ctggcttaga 
ccaaaatcct 
gtggtccttg 
tcaaccagga 
gggaagctga 
tggaacgact 
tcccttactt 
gtgctgagga 
ccatcactgt 
tgagacggat 
gcttctttgc 
tgaagaagga 
agactctcag 
agaccattcc 
tgactggact 
agaggaaact 
tgctggacat 
ccccaaagta 
ctacggtgat 
agtgtggagt 
acgaaggcta 
agaatgctca 
aagtggggga 
accacacagc 
agaaaggctc 
tgtccaccca 
aggccgtcta 
ctgtgtttga 
ccgagttgct 
ggggaacgca 
ccattgccaa 
tcaggaaagc 
agactgctcc 
ctgcagtcat 
aggtcatgga 



cgcgtctgcg 
gacggcgacc 
aagtgaaatc 
ctcgtctgcc 
ccagtttaaa 
cagagctgcc 
tgtgggcaag 
agattacttt 
cattcccatt 
agcagatctg 
cccaggcaac 
cagtgagatc 
cgaccctaat 
tggcattctg 
ggtatctgtg 
tgaagccatt 
tgccgatggg 
ggcactggct 
tctccgccga 
tacgttagtg 
cccagacatg 
cagagggcgt 
cggagacact 
gattgctgaa 
ggcccagctg 
ttacgctatc 
caattaccat 
ggctctgcgc 
ggtgctggac 
cctggtgaag 
ggtccgagga 
tcaggtctgg 
tacgcacatt 
attggttgct 
acagatcaag 
tacccaggat 
tgagacctat 
ggatgacccc 
cctgcggaac 
gggtatccgg 
agagagcttg 
aaacaaggat 
cccccagtgg 
tgacttggac 



ggaataggtg 
ctaggagagt 
cggcagcgat 
accatcccat 
cccattttcc 
aatacccaga 
gatgtctatc 
aaggaattgg 
gaaagacttt 
gaatgcaaac 
atgaaggata 
cactacgacc 
gtgctggaga 
aaacctcttc 
ctgcagaata 
cagaagggca 
attgacatgg 
gatggtggcc 
gctgtccgat 
gatgttgtcg 
gtgaaggaca 
cgcatcctgg 
gcttggctcc 
gagaagggcc 
aaatcacagg 
gaagagctcc 
ttggactcca 
agggagaaga 
aagacctgtt 
gtggatgaca 
gggtatgtgc 
ctgtttattg 
ctgaacttcg 
cctgaccgcc 
aaggctgaag 
tgccccctgg 
cctgaccctg 
tctgggcctg 
tcgagtcatg 
aggattgtgg 
aagaaatgtc 
gtgcagaggg 
cagaaggatg 
cgagccagca 



cagcgggccc 
tctttggggt 
ttatagattt 
tggatgaccc 
tgaacacaat 
agtgcatccg 
atcacacctt 
catgtaagat 
atgttactta 
agatctggca 
acttctggga 
ggattggtgg 
tctggaacct 
ccaagaaaag 
agatgtccaa 
caggtgcccg 
cctaccgggt 
ggcctgacaa 
acgcccatga 
tccagtccct 
tcattaatga 
acaggaaaat 
tctatgacac 
tggtggtaga 
gcaagggagc 
gggcacgggg 
gtggtagcta 
tgttcgtgga 
tctatgctga 
gcagtgaaga 
tacacattgg 
atgagccccg 
ccctgcgctc 
tcagatttga 
agattgctaa 
cagcagcgaa 
tgcgagtcgt 
ctggctccct 
caggagcttt 
ctgtcacagg 
tctctgtcat 
agatcgctga 
aattgcggga 
aagccgatgt 



ttggcggggg 
gactttcaag 
cttcaagagg 
cactttgctc 
tgacccatct 
ggctgggggc 
cttcgagatg 
ggctctggaa 
ctttggcggg 
aaatttgggg 
gatgggtgac 
tcgggacgcc 
tgtgttcatc 
cattgacaca 
ctatgacact 
accatacact 
gctggctgac 
cacagggcgt 
aaagctcaat 
gggagatgca 
agaagaggtg 
tcagagcctg 
ctatgggttt 
catggatggc 
tggtggggaa 
tctggaggtc 
tgtatttgag 
agaggtgtcc 
gcaaggaggc 
taaaacagag 
aaccatctac 
acgaagaccc 
agtgcttggg 
ctttactgcc 
tgagatgatt 
agccatccag 
ctccattggg 
gacttctgtt 
tgtgatcgtg 
tgccgaggcc 
ggaagccaaa 
ccttggagag 
gactctcaaa 
ccagaaacga 



actctgaggg 
atggactcta 
aacgagcata 
tttgccaatg 
caccccatgg 
aaacaaaatg 
ctgggctctt 
ctcctcaccc 
gatgaagcag 
ctggatgaca 
acgggcccct 
gcacatcttg 
cagtataaca 
gggatgggcc 
gacctttttg 
gggaaagttg 
catgctcgga 
ggatatgtgt 
gccagcaggg 
tttcctgagc 
cagtttctca 
ggagacagca 
ccagtggatc 
tttgaagagg 
gacctcatta 
acagatgatt 
aacacagtgg 
acaggccagg 
cagatctatg 
tttacagtga 
ggtgacctga 
atcatgagca 
gaagctgacc 
aagggagcca 
gaggcagcca 
ggcctacggg 
gtcccggtgt 
gagttctgtg 
acggaagaag 
cagaaggccc 
gtgaaggctc 
gccctggcca 
tccctaaaga 
gtgttagaga 



agacgaagca gttcatcgac agcaacccca accagcctct tgtcatcctg gagatggaga 
gcggcgcctc agccaaggcc ctgaatgaag ccttgaagct cttcaagatg cactcccctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 
2760 
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agacttctgc 
aagtccccca 
caggcttgat 
acgttggctg 
gggatgtaaa 
agagctcfctc 
acccagcagt 
ctgccctgag 
tgatcgtcac 
aagtaggaat 



catgctcttc 
gaatgcagcc 
ggacggtaaa 
cctgcaggag 
gaactgagtg 
atctgctaca 
aactggaaca 
ccctccacgt 
gctcgcatct 
cgtttttgct 



acggtggaca 
aatcggggct 
ggtggtggca 
gcgctgcagc 
gggaaggagg 
agaacatttg 
cacfctgggag 
cagtgccatc 
atagataacg 
gcagagaata 



atgaggctgg 
taaaagccag 
aggatgtgtc 
tggccacttc 
aggctcccac 
aatcttggga 
cagtcctatg 
ggtctagaac 
gctctccaga 
aaaggaccac 



caagatcacg 
cgagtgggtg 
tgcacaggcc 
cttcgcccag 
tggatccatc 
cctttaaaga 
tctcagtgcc 
cactaacccc 
cctgagcttt 
gtgc 3344 



tgcctgtgtc 2 82 0 

cagcaggtgt 2 880 

acaggcaaga 2940 

ctgcgcctcg 3 00 0 

cgtccagcca 306 0 

gcccctccta 312 0 

ccttaaattt 3180 

gcattgctgt 32 40 

ccgcgtcagc 33 00 



<210> 78 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001605 



gccaagagct cttcatctgc tacaagaaca tttgaatctt gggaccttta aagagcccct 60 



60 

120 

180 

240 

300 



<210> 79 
<211> 417 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_001645 

acctcccaac caagccctcc agcaaggatt caggagtgcc cctcgggcct cgccatgagg 
ctcttcctgt cgctcccggt cctggtggtg gttctgtcga tcgtcttgga aggcccagcc 
ccagcccagg ggaccccaga cgtctccagt gccttggata agctgaagga gtttggaaac 
acactggagg acaaggctcg ggaactcatc agccgcatca aacagagtga actttctgcc 
aagatgcggg agtggttttc agagacattt cagaaagtga aggagaaact caagattgac 
tcatgaggac ctgaagggtg acatccagga ggggcctctg aaatttccca caccccagcg 360 
cctgtgctga ggactcccgc catgtggccc caggtgccac caataaaaat cctaccg 417 

<210> 80 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001645 

aaacagagtg aactttctgc caagatgcgg gagtggtttt cagagacatt tcagaaagtg 6 0 

<210> 81 

<211> 1389 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001809 



cgcggacttc tgccaagcac cggctcatgt gaggctcgcg gcacagcgtt ctctgggctc 



60 
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cccagaagcc agcctttcgc tcccggaccc ggcagcccga gcaggagccg tgggaccggg 12 0 

cgccagcacc ctctgcggcg tgtcatgggc ccgcgccgcc ggagccgaaa gcccgaggcc 180 

ccgaggaggc gcagcccgag cccgaccccg acccccggcc cctcccggcg gggcccctcc 240 

ttaggcgctt cctcccatca acacagtcgg cggagacaag gttggctaaa ggagatccga 300 

aagcttcaga agagcacaca cctcttgata aggaagctgc ccttcagccg cctggcaaga 360 

gaaatatgtg ttaaattcac tcgtggtgtg gacttcaatt ggcaagccca ggccctattg 420 

gccctacaag aggcagcaga agcatttcta gttcatctct ttgaggacgc ctatctcctc 480 

accttacatg caggccgagt tactctcttc ccaaaggatg tgcaactggc ccggaggatc 54 0 

cggggccttg aggagggact cggctgagct cctgcaccca gtgtttctgt cagtctttcc 600 

tgctcagcca ggggggatga taccggggac tctccagagc catgactaga tccaatggat 66 0 

tctgcgatgc tgtctggact ttgctgtctc tgaacagtat gtgtgtgttg ctttaaatat 72 0 

ttttcttttt tttgagaagg agaagactgc atgactttcc tctgtaacag aggtaatata 780 

tgagacaatc aacaccgttc caaaggcctg aaaataattt tcagataaag agactccaag 840 

gttgacttta gtttgtgagt tactcatgtg actatttgag gattttgaaa acatcagatt 900 

tgctgtggta tgggagaaaa ggttatgtac ttattatttt agctctttct gtaatattta 960 

cattttttac catatgtaca tttgtacttt tattttacac ataagggaaa aaataagacc 102 0 

actttgagca gttgcctgga aggctgggca tttccatcat atagacctct gcccttcaga 1080 

gtagcctcac cattagtggc agcatcatgt aactgagtgg actgtgcttg tcaacggatg 1140 

tgtagctttt cagaaactta attggggatg aatagaaaac ctgtaagctt tgatgttctg 12 00 

gttacttcta gtaaattcct gtcaaaatca attcagaaat tctaacttgg agaatttaac 12 60 

attttactct tgtaaatcat agaagafcgta tcataacagt tcagaatttt aaagtacatt 132 0 

ttcgatgctt ttatgggtat ttttgtagtt tctttgtaga gagataataa aaatcaaaat 1380 
atttaatga 1389 



<210> 82 
<211> 60 
<212> DNA 



<213> Homo sapiens 



<300> 

<308> NM_001809 
<400> 82 

ggggatgaat agaaaacctg taagctttga 



tgttctggtt acttctagta aattcctgtc 60 



<210> 83 

<211> 2205 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_001909 



<400> 83 

gcgcacgccg gccgcgccca cgtgaccggt 
ggcccaactg cggcgtcatc ccggctataa 
ggccgccgcc gccatgcagc cctccagcct 
acccgcctcc gcgctcgtca ggatcccgct 
gtcggaggtt gggggctctg tggaggacct 
ccaggcggtg ccagccgtga ccgaggggcc 
cgcccagtac tacggggaga ttggcatcgg 
cgacacgggc tcctccaacc tgtgggtccc 
ttgctggatc caccacaagt acaacagcga 
ctcgtttgac atccactatg gctcgggcag 
gtcggtgccc tgccagtcag cgtcgtcagc 
gcaggtcttt ggggaggcca ccaagcagcc 
tggcatcctg ggcatggcct acccccgcat 
caacctgatg cagcagaagc tggtggacca 
cccagatgcg cagcctgggg gtgagctgat 



ccgggtgcaa acacgcgggt cagctgatcc 6 0 

gcgcacggcc tcggcgaccc tctccgaccc 12 0 

tctgccgctc gccctctgcc tgctggctgc 180 

gcacaagttc acgtccatcc gccggaccat 240 

gattgccaaa ggccccgtct caaagtactc 3 00 

cattcccgag gtgctcaaga actacatgga 3 60 

gacgcccccc cagtgcttca cagtcgtctt 42 0 

ctccatccac tgcaaactgc tggacatcgc 480 

caagtccagc acctacgtga agaatggtac 540 

cctctccggg tacctgagcc aggacactgt 600 

ctctgccctg ggcggtgtca aagtggagag 660 

aggcatcacc ttcatcgcag ccaagttcga 720 

ctccgtcaac aacgtgctgc ccgtcttcga 7 80 

gaacatcttc tccttctacc tgagcaggga 840 

gctgggtggc acagactcca agtattacaa 900 



40 



WO 2005/086891 



PCT/US2005/007894 



gggttctctg 
ggtggaggtg 
aggcacttcc 
ggccgtgccg 
cgcgatcaca 
caaggtgtcg 
gccacccagc 
gtttgaccgt 
gcgtccgcgc 
cggcccctcc 
cggcggccca 
atgctgcctg 
agatctgttt 
ggggctccct 

gcccctaccc 
gggcccctct 
gggcccgttg 
acaaaaccca 
atgtattggc 
gacctctgtt 
ttggcctccg 
aaaaaaaaaa 



tcctacctga 
gccagcgggc 
ctcatggtgg 
ctgattcagg 
ctgaagctgg 
caggccggga 
gggccactct 
gacaacaaca 
gccagcacag 
cacacacacc 
agcccgactt 
cctgtctgtc 
cgtgcattgg 
tcatctccag 

cacaccaggc 
gcccagcttg 
aggatgaggc 
ccttgttgga 
ctggaggtgg 
gtcctcccct 
gcctcccctc 
aaaaaaaaaa 



atgtcacccg 
tgaccctgtg 
gcccggtgga 
gcgagtacat 
gaggcaaagg 
agaccctctg 
ggatcctggg 
gggtgggctt 
aaacagagga 
cacacactcg 
gctgttttgt 
tctccatctg 
aagaccccac 
ggagtcccct 

cgtcctcccg 
ggcccagctg 
cgctagaggc 
gcctgcaggg 
ggttgggatt 
tgggcggctg 
aaaaaaaaaa 
aaaaaaaaaa 



caaggcctac 
caaggagggc 
tgaggtgcgc 
gatcccctgt 
ctacaagctg 
cctgagcggc 
cgacgtcttc 
cgccgaggct 
gagtcccaga 
cccgcccact 
tctgtggttt 
tttggtgggg 
ccaagcttgg 
ccccggccct 

ggccctccct 
ggctctgcca 
ctgaggatga 
tggtgctggg 
gggggctggt 
agagccccag 
aaaaaaaaaa 
aaaaaaaaaa 



tggcaggtcc 
tgtgaggcca 
gagctgcaga 
gagaaggtgt 
tccccagagg 
ttcatgggca 
atcggccgct 
gcccgcctct 
gcaggaggcc 
gtcctgggcg 
tcccctccct 
gtagagctga 
c age c gage t 
accagcgccc 



acctggacca 
ttgtggacac 
aggecategg 
ccaccctgcc 
actacacgct 
tggacatccc 
actacactgt 
agttcccaag 
cctggcccag 
ccctggaagc 
gggttcagaa 
tccagagcac 
cgtgtatcct 
gctgggctga 



tggaaacctg 
ccctacctgt 
gctggaagga 
actgagecag 
gccagccttc 
ctgacatgga 
aaaaaaaaaa 
aaaaa 2205 



ccctgcctga 
tcagtgtccc 
gtgagagggg 
tcccaggggc 
ctctgcagct 
aatacagttg 
aaaaaaaaaa 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
1860 
1920 
1980 
2040 
2100 
2160 



<210> 84 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001909 
<400> 84 

tctgtttggt gggggtagag ctgatccaga gcacagatct gtttcgtgca ttggaagacc 60 

<210> 85 

<211> 817 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002038 



<400> 85 

gaaccgttta 

cttctctcct 

eggtatcget 

agaaaaagtg 

tggcegtegg 

gcatcgcggc 

gcggcgtgcc 

gegtegtcat 

gtgaggagga 

tcttccagtt 

gttctcacta 

actgcagcct 

tacaagcatg 

cctagatgtg 

<210> 86 
<211> 60 



ctcgctgctg 
ccaaggtcta 
tttcttgtgc 
cteggagage 
aggaggactc 
caacteggtg 
cgccgggggg 
aggtaatatt 
tgaggagtag 
aggatctaga 
tattgtccag 
ccaactccta 
cgccgacgat 
aaaacagaat 



tgcccatcta tcagcaggct ccgggctgaa gattgettet 60 

gtgaeggage ccgcgcgcgg cgccaccatg eggcagaagg 12 0 

tacctgctgc tcttcacttg cagtggggtg gaggcaggta 180 

teggacageg getcegggtt ctggaaggcc ctgaccttca 240 

gcagtcgccg ggctgcccgc gctgggcttc accggcgccg 3 00 

gctgcctcgc tgatgagctg gtctgegate ctgaatgggg 3 60 

ctagtggcca egctgeagag cctcggggct ggtggcagca 420 

ggtgccctga tgggctaege cacccacaag tatctcgata 480 

ccagcagctc ccagaacctc ttcttccttc ttggcctaac 540 

actttgeett tttttttttt tttttttttt tttgagatgg 600 

gctagagtgc agtggctatt cacagatgeg aacatagtac 660 

gectcaagtg atcctcctgt ctcaacctcc caagtaggat 720 

gcccagaatc cagaactttg tctatcactc tccccaacaa 780 
aaacttcacc cagaaaa 817 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002038 
<400> 86 

agctcccaga acctcttctt ccttcttggc ctaactcttc cagttaggat ctagaacttt 60 

<210> 87 

<211> 1283 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002046 
<400> 87 

ctctctgctc ctcctgttcg acagtcagcc gcatcttctt ttgcgtcgcc agccgagcca 60 

catcgctcag acaccatggg gaaggtgaag gtcggagtca acggatttgg tcgtattggg 120 

cgcctggtca ccagggctgc ttttaactct ggtaaagtgg atattgttgc catcaatgac 180 

cccttcattg acctcaacta catggtttac atgttccaat atgattccac ccatggcaaa 240 

ttccatggca ccgtcaaggc tgagaacggg aagcttgtca tcaatggaaa tcccatcacc 300 

atcttccagg agcgagatcc ctccaaaatc aagtggggcg atgctggcgc tgagtacgtc 360 

gtggagtcca ctggcgtctt caccaccatg gagaaggctg gggctcattt gcagggggga 42 0 

gccaaaaggg tcatcatctc tgccccctct gctgatgccc ccatgttcgt catgggtgtg 480 

aaccatgaga agtatgacaa cagcctcaag atcatcagca atgcctcctg caccaccaac 540 

tgcttagcac ccctggccaa ggtcatccat gacaactttg gtatcgtgga aggactcatg 600 

accacagtcc atgccatcac tgccacccag aagactgtgg atggcccctc cgggaaactg 660 

tggcgtgatg gccgcggggc tctccagaac atcatccctg cctctactgg cgctgccaag 72 0 

gctgtgggca aggtcatccc tgagctgaac gggaagctca ctggcatggc cttccgtgtc 7 80 

cccactgcca acgtgtcagt ggtggacctg acctgccgtc tagaaaaacc tgccaaatat 840 

gatgacatca agaaggtggt gaagcaggcg tcggagggcc ccctcaaggg catcctgggc 9 00 

tacactgagc accaggtggt ctcctctgac ttcaacagcg acacccactc ctccaccttt 960 

gacgctgggg ctggcattgc cctcaacgac cactttgtca agctcatttc ctggtatgac 1020 

aacgaatttg gctacagcaa cagggtggtg gacctcatgg cccacatggc ctccaaggag 10 80 

taagacccct ggaccaccag ccccagcaag agcacaagag gaagagagag accctcactg 1140 

ctggggagtc cctgccacac tcagtccccc accacactga atctcccctc ctcacagttg 12 00 

ccatgtagac cccttgaaga ggggaggggc ctagggagcc gcaccttgtc atgtaccatc 12 60 
aataaagtac cctgtgctca acc 1283 

<210> 88 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002046 
<400> 88 

ctcaacgacc actttgtcaa gctcatttcc tggtatgaca acgaatttgg ctacagcaac 60 

<210> 89 

<211> 1610 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002061 
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60 



<400> 89 

ggcacgaggc tgcggccgca gtagccggag ccggagccgc agccaccggt gccttccttt 

cccgccgccg cccagccgcc gtccggcctc cctcgggccc gagcgcagac caggctccag 120 

ccgcgcggcg ccggcagcct cgcgctccct ctcgggtctc tctcgggcct cgggcaccgc 180 

gtcctgtggg cggccgcctg cctgcccgcc cgcccgcagc cccttgcctg ccggcccctg 240 

ggcggcccgt gccatgggca ccgacagccg cgcggccaag gcgctcctgg cgcgggcccg 300 

caccctgcac ctgcagacgg ggaacctgct gaactggggc cgcctgcgga agaagtgccc 360 

gtccacgcac agcgaggagc ttcatgattg tatccaaaaa accttgaatg aatggagttc 42 0 

ccaaatcaac ccagatttgg tcagggagtt tccagatgtc ttggaatgca ctgtatctca 48 0 

tgcagtagaa aagataaatc ctgatgaaag agaagaaatg aaagtttctg caaaactgtt 540 

cattgtagaa tcaaactctt catcatcaac tagaagtgca gttgacatgg cctgttcagt 600 

ccttggagtt gcacagctgg attctgtgat cattgcttca cctcctattg aagatggagt 66 0 

taatctttcc ttggagcatt tacagcctta ctgggaggaa ttagaaaact tagttcagag 72 0 

caaaaagatt gttgccatag gtacctctga tctagacaaa acacagttgg aacagctgta 7 80 

tcagtgggca caggtaaaac caaatagtaa ccaagttaat cttgcctcct gctgtgtgat 840 

gccaccagat ttgactgcat ttgctaaaca atttgacata cagctgttga ctcacaatga 9 00 

tccaaaagaa ctgctttctg aagcaagttt ccaagaagct cttcaggaaa gcattcctga 960 

cattcaagcg cacgagtggg tgccgctgtg gctactgcgg tattcggtca ttgtgaaaag 1020 

tagaggaatt atcaaatcaa aaggctacat tttacaagct aaaagaaggg gttcttaact 1080 

gacttaggag cataacttac ctgtaatttc cttcaatatg agagaaaatt gagatgtgta 1140 

aaatctagtt actgcctgta aatggtgtca ttgaggcaga tattctttcg tcatatttga 12 00 

cagtatgttg tctgtcaagt tttaaatact tatcttgcct ccatatcaat ccattctcat 12 60 

gaacctctgt attgctttcc ttaaactatt gttttctaat tgaaattgtc tataaagaaa 1320 

atacttgcaa tatatttttc ctttattttt atgactaata taaatcaaga aaatttgttg 1380 

ttagatatat tttggcctag gtatcagggt aatgtatata catatttttt atttccaaaa 1440 
aaaattcatt aattgcttct taactcttat tataaccaag caatttaatt acaattgtta 150 0 
aaactgaaat actggaagaa gatatttttc ctgtcattga tgagatatat cagagtaact 1560 
ggagtagctg ggatttacta gtagtgtaaa taaaattcac tcttcaatac 1610 



60 



<210> 90 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_002061 
<400> 90 

ctgacttagg agcataactt acctgtaatt tccttcaata tgagagaaaa ttgagatgtg 

<210> 91 

<211> 873 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002106 
<400> 91 

cgcagtttga atcgcggtgc gacgaaggag taggtggtgg gatctcaccg tgggtccgat 
tagccttttc tctgccttgc ttgcttgagc ttcagcggaa ttcgaaatgg ctggcggtaa 
ggctggaaag gactccggaa aggccaagac aaaggcggtt tcccgctcgc agagagccgg 180 
cttgcagttc ccagtgggcc gtattcatcg acacctaaaa tctaggacga ccagtcatgg 240 
acgtgtgggc gcgactgccg ctgtgtacag cgcagccatc ctggagtacc tcaccgcaga 3 00 
ggtacttgaa ctggcaggaa atgcatcaaa agacttaaag gtaaagcgta ttacccctcg 3 60 
tcacttgcaa cttgctattc gtggagatga agaattggat tctctcatca aggctacaat 420 
tgctggtggt ggtgtcattc cacacatcca caaatctctg attgggaaga aaggacaaca 480 
gaagactgtc taaaggatgc ctggattcct tgttatctca ggactctaaa tactctaaca 540 
gctgtccagt gttggtgatt ccagtggact gtatctctgt gaaaaacaca attttgcctt 600 
tttgtaattc tatttgagca agttggaagt ttaattagct ttccaaccaa ccaaatttct 660 



60 

120 
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gcattcgagt cttaaccata tttaagtgtt actgtggctt caaagaagct attgattctg 7 20 

aagtagtggg ttttgattga gttgactgtt tttaaaaaac tgtttggatt ttaattgtga 7 80 

tgcagaagtt atagtaacaa acatttggtt ttgtacagac attatttcca ctctggtgga 840 
taagttcaat aaaggtcata tcccaaacta aaa 873 

<210> 92 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002106 
<400> 92 

cgagtcttaa ccatatttaa gtgttactgt ggcttcaaag aagctattga ttctgaagta 60 

<210> 93 

<211> 4204 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002205 
<400> 93 

caggacaggg aagagcgggc gctatgggga gccggacgcc agagtcccct ctccacgccg 
tgcagctgcg ctggggcccc cggcgccgac ccccgctcgt gccgctgctg ttgctgctcg 
tgccgccgcc acccagggtc gggggcttca acttagacgc ggaggcccca gcagtactct 
cggggccccc gggctccttc ttcggattct cagtggagtt ttaccggccg ggaacagacg 
gggtcagtgt gctggtggga gcacccaagg ctaataccag ccagccagga gtgctgcagg 
gtggtgctgt ctacctctgt ccttggggtg ccagccccac acagtgcacc cccattgaat 
ttgacagcaa aggctctcgg ctcctggagt cctcactgtc cagctcagag ggagaggagc 
ctgtggagta caagtccttg cagtggttcg gggcaacagt tcgagcccat ggctcctcca 
tcttggcatg cgctccactg tacagctggc gcacagagaa ggagccactg agcgaccccg 
tgggcacctg ctacctctcc acagataact tcacccgaat tctggagtat gcaccctgcc 
gctcagattt cagctgggca gcaggacagg gttactgcca aggaggcttc agtgccgagt 
tcaccaagac tggccgtgtg gttttaggtg gaccaggaag ctatttctgg caaggccaga 
tcctgtctgc cactcaggag cagattgcag aatcttatta ccccgagtac ctgatcaacc 
tggttcaggg gcagctgcag actcgccagg ccagttccat ctatgatgac agctacctag 
gatactctgt ggctgttggt gaattcagtg gtgatgacac agaagacttt gttgctggtg 
tgcccaaagg gaacctcact tacggctatg tcaccatcct taatggctca gacattcgat 
ccctctacaa cttctcaggg gaacagatgg cctcctactt tggctatgca gtggccgcca 
cagacgtcaa tggggacggg ctggatgact tgctggtggg ggcacccctg ctcatggatc 
ggacccctga cgggcggcct caggaggtgg gcagggtcta cgtctacctg cagcacccag 
ccggcataga gcccacgccc acccttaccc tcactggcca tgatgagttt ggccgatttg 
gcagctcctt gacccccctg ggggacctgg accaggatgg ctacaatgat gtggccatcg 
gggctccctt tggtggggag acccagcagg gagtagtgtt tgtatttcct gggggcccag 
gagggctggg ctctaagcct tcccaggttc tgcagcccct gtgggcagcc agccacaccc 
cagacttctt tggctctgcc cttcgaggag gccgagacct ggatggcaat ggatatcctg 
atctgattgt ggggtccttt ggtgtggaca aggctgtggt atacaggggc cgccccatcg 
tgtccgctag tgcctccctc accatcttcc ccgccatgtt caacccagag gagcggagct 
gcagcttaga ggggaaccct gtggcctgca tcaaccttag cttctgcctc aatgcttctg 
gaaaacacgt tgctgactcc attggtttca cagtggaact tcagctggac tggcagaagc 
agaagggagg ggtacggcgg gcactgttcc tggcctccag gcaggcaacc ctgacccaga 
ccctgctcat ccagaatggg gctcgagagg attgcagaga gatgaagatc tacctcagga 
acgagtcaga atttcgagac aaactctcgc cgattcacat cgctctcaac ttctccttgg 
acccccaagc cccagtggac agccacggcc tcaggccagc cctacattat cagagcaaga 
gccggataga ggacaaggct cagatcttgc tggactgtgg agaagacaac atctgtgtgc 
ctgacctgca gctggaagtg tttggggagc agaaccatgt gtacctgggt gacaagaatg 
ccctgaacct cactttccat gcccagaatg tgggtgaggg tggcgcctat gaggctgagc 
ttcgggtcac cgcccctcca gaggctgagt actcaggact cgtcagacac ccagggaact 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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2700 
2760 
2820 



tctccagcct gagctgtgac tactttgccg tgaaccagag ccgcctgctg gtgtgtgacc 222 0 

tgggcaaccc catgaaggca ggagccagtc tgtggggtgg ccttcggttt acagtccctc 2280 

atctccggga cactaagaaa accatccagt ttgacttcca gatcctcagc aagaatctca 2340 

acaactcgca aagcgacgtg gtttcctttc ggctctccgt ggaggctcag gcccaggtca 2400 

ccctgaacgg tgtctccaag cctgaggcag tgctattccc agtaagcgac tggcatcccc 2460 

gagaccagcc tcagaaggag gaggacctgg gacctgctgt ccaccatgtc tatgagctca 252 0 

tcaaccaagg ccccagctcc attagccagg gtgtgctgga actcagctgt ccccaggctc 2580 

tggaaggtca gcagctccta tatgtgacca gagttacggg actcaactgc accaccaatc 2 64 0 
accccattaa cccaaagggc ctggagttgg atcccgaggg ttccctgcac caccagcaaa 
aacgggaagc tccaagccgc agctctgctt cctcgggacc tcagatcctg aaatgcccgg 
aggctgagtg tttcaggctg cgctgtgagc tcgggcccct gcaccaacaa gagagccaaa 

gtctgcagtt gcatttccga gtctgggcca agactttctt gcagcgggag caccagccat 2880 

ttagcctgca gtgtgaggct gtgtacaaag ccctgaagat gccctaccga atcctgcctc 2940 

ggcagctgcc ccaaaaagag cgtcaggtgg ccacagctgt gcaatggacc aaggcagaag 3000 

gcagctatgg cgtcccactg tggatcatca tcctagccat cctgtttggc ctcctgctcc 3060 

taggtctact catctacatc ctctacaagc ttggattctt caaacgctcc ctcccatatg 312 0 

gcaccgccat ggaaaaagct cagctcaagc ctccagccac ctctgatgcc tgagtcctcc 3180 

caatttcaga ctcccattcc tgaagaacca gtccccccac cctcattcta ctgaaaagga 3240 

ggggtctggg tacttcttga aggtgctgac ggccagggag aagctcctct ccccagccca 3300 

gagacatact tgaagggcca gagccagggg ggtgaggagc tggggatccc tcccccccat 33 60 

gcactgtgaa ggacccttgt ttacacatac cctcttcatg gatgggggaa ctcagatcca 3420 

gggacagagg cccagcctcc ctgaagcctt tgcattttgg agagtttcct gaaacaactg 3480 

gaaagataac taggaaatcc attcacagtt ctttgggcca gacatgccac aaggacttcc 3 540 

tgtccagctc caacctgcaa agatctgtcc tcagccttgc cagagatcca aaagaagccc 3 600 

ccagtaagaa cctggaactt ggggagttaa gacctggcag ctctggacag ccccaccctg 3 660 

gtgggccaac aaagaacact aactatgcat ggtgccccag gaccagctca ggacagatgc 3720 

cacaaggata gatgctggcc cagggccaga gcccagctcc aaggggaatc agaactcaaa 37 80 

tggggccaga tccagcctgg ggtctggagt tgatctggaa cccagactca gacattggca 3 840 

ccaatccagg cagatccagg actatatttg ggcctgctcc agacctgatc ctggaggccc 3 900 

agttcaccct gatttaggag aagccaggaa tttcccagga cctgaagggg ccatgatggc 3 96 0 

aacagatctg gaacctcagc ctggccagac acaggccctc cctgttcccc agagaaaggg 

gagcccactg tcctgggcct gcagaatttg ggttctgcct gccagctgca ctgatgctgc 

ccctcatctc tctgcccaac ccttccctca ccttggcacc agacacccag gacttattta 

aactctgttg caagtgcaat aaatctgacc cagtgccccc actgaccaga actagaaaaa 42 00 
aaaa 4204 

<210> 94 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002205 
<400> 94 

ttggcaccag acacccagga cttatttaaa ctctgttgca agtgcaataa atctgaccca 

<210> 95 
<211> 1976 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_002266 
<400> 95 

gccacacggt ctttgagctg agtcgaggtg gaccctttga acgcagtcgc cctacagccg 



4020 
4080 
4140 



60 



60 



ctgattcccc ccgcatcgcc tcccgtggaa gcccaggccc gcttcgcagc tttctccctt 120 
tgtctcataa ccatgtccac caacgagaat gctaatacac cagctgcccg tcttcacaga 180 
ttcaagaaca agggaaaaga cagtacagaa atgaggcgtc gcagaataga ggtcaatgtg 240 
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gagctgagga aagctaagaa ggatgaccag atgctgaaga ggagaaatgt aagctcattt 300 

cctgatgatg ctacttctcc gctgcaggaa aaccgcaaca accagggcac tgtaaattgg 3 60 

tctgttgatg acattgtcaa aggcataaat agcagcaatg tggaaaatca gctccaagct 420 

actcaagctg ccaggaaact actttccaga gaaaaacagc cccccataga caacataatc 480 

cgggctggtt tgattccgaa atttgtgtcc ttcttgggca gaactgattg tagtcccatt 540 

cagtttgaat ctgcttgggc actcactaac attgcttctg ggacatcaga acaaaccaag 600 

gctgtggtag atggaggtgc catcccagca t teat tt etc tgttggcatc tccccatgct 660 

cacatcagtg aacaagctgt ctgggctcta ggaaacattg caggtgatgg ctcagtgttc 72 0 

cgagacttgg ttattaagta cggtgcagtt gacccactgt tggctctcct tgcagttcct 7 80 

gatatgtcat ctttagcatg tggctactta cgtaatctta cctggacact ttctaatctt 840 

tgccgcaaca agaatcctgc acccccgata gatgctgttg agcagattct tcctacctta 900 

gttcggctcc tgcatcatga tgatccagaa gtgttagcag atacctgetg ggctatttcc 960 

taccttactg atggtccaaa tgaacgaatt ggcatggtgg tgaaaacagg agttgtgccc 102 0 

caacttgtga agcttctagg agcttctgaa ttgccaattg tgactcctgc cctaagagcc 1080 

atagggaata ttgtcactgg tacagatgaa cagactcagg ttgtgattga tgeaggagea 1140 

ctcgccgtct ttcccagcct gctcaccaac cccaaaacta acattcagaa ggaagctacg 1200 

tggacaatgt caaacatcac agccggccgc caggaccaga tacagcaagt tgtgaatcat 126 0 

ggattagtcc cattccttgt cagtgttctc tctaaggcag attttaagac acaaaaggaa 132 0 

gctgtgtggg ccgtgaccaa ctataccagt ggtggaacag ttgaacagat tgtgtacctt 1380 

gttcactgtg gcataataga accgttgatg aacctcttaa ctgeaaaaga taccaagatt 144 0 

attctggtta tcctggatgc catttcaaat atctttcagg ctgctgagaa actaggtgaa 1500 

actgagaaac ttagtataat gattgaagaa tgtggaggct tagacaaaat tgaagctcta 15 60 

caaaaccatg aaaatgagtc tgtgtataag gcttcgttaa gcttaattga gaagtatttc 162 0 

tctgtagagg aagaggaaga teaaaaegtt gtaccagaaa ctacctctga aggctacact 1680 

ttccaagttc aggatggggc tcctgggacc tttaactttt agatcatgta gctgagacat 1740 

aaatttgttg tgtactacgt ttggtatttt gtcttattgt ttctctacta agaactcttt 1800 

cttaaatgtg gtttgttact gtagcacttt ttacactgaa actatacttg aacagttcca 1860 

actgtacata catactgtat gaagcttgtc ctctgactag gtttctaatt tctatgtgga 192 0 
atttcctatc ttgeagcate ctgtaaataa acattcaagt ccacccttaa aaaaaa 19 7 6 

<210> 96 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD02266 
<400> 96 

tgagtctgtg tataaggctt cgttaagctt aattgagaag tatttctctg tagaggaaga 60 

<210> 97 

<211> 1145 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002346 
<400> 97 

gctccggcca gccgcggtcc agagegegeg aggttcgggg agctccgcca ggctgctggt 60 

acctgcgtcc gcccggcgag caggacaggc tgctttggtt tgtgacctcc aggcaggacg 12 0 

gccatcctct ccagaatgaa gatcttcttg ccagtgctgc tggctgccct tctgggtgtg 180 

gagegageca getegctgat gtgcttctcc tgettgaace agaagagcaa tctgtactgc 240 

ctgaagcega ccatctgctc cgaccaggac aactactgeg tgactgtgtc tgctagtgcc 300 

ggcattggga atetegtgae atttggecac agectgagea agacctgttc cccggcctgc 360 

cccatcccag aaggcgtcaa tgttggtgtg gcttccatgg gcatcagctg ctgccagagc 420 

tttctgtgca atttcagtgc ggccgatggc gggctgcggg caagcgtcac cctgctgggt 480 

gccgggctgc tgetgagect gctgccggcc ctgctgcggt ttggcccctg accgcccaga 540 

ccctgtcccc cgatccccca gctcaggaag gaaagcccag ccctttctgg atcccacagt 600 

gtatgggagc ccctgactcc tcacgtgcct gatctgtgcc cttggtccca ggtcaggccc 660 
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accccctgca cctccacctg ccccagcccc tgcctctgcc caagtgggcc agctgccctc 720 

acttctgggg tggatgatgt gaccttcctt gggggactgc ggaagggacg agggttccct 780 

ggagtcttac ggtccaacat cagaccaagt cccatggaca tgctgacagg gtccccaggg 840 

agaccgtgtc agtagggatg tgtgcctggc tgtgtacgtg ggtgtgcagt gcacgtgaga 900 

gcacgtggcg gcttctgggg gccatgtttg gggagggagg tgtgccagca gcctggagag 960 

cctcagtccc tgtagccccc tgccctggca cagctgcatg cacttcaagg gcagcctttg 102 0 

ggggttgggg tttctgccac ttccgggtct aggccctgcc caaatccagc cagtcctgcc 1080 

ccagcccacc cccacattgg agccctcctg ctgctttggt gcctcaaata aatacagatg 114 0 
tcccc 1145 

<210> 98 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_0023 46 
<400> 98 

ggttccctgg agtcttacgg tccaacatca gaccaagtcc catggacatg ctgacagggt 6 0 

<210> 99 

<211> 1390 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002358 
<400> 99 

gggaagtgct gttggagccg ctgtggttgc tgtccgcgga gtggaagcgc gtgcttttgt 60 

ttgtgtccct ggccatggcg ctgcagctct cccgggagca gggaatcacc ctgcgcggga 12 0 

gcgccgaaat cgtggccgag ttcttctcat tcggcatcaa cagcatttta tatcagcgtg 180 

gcatatatcc atctgaaacc tttactcgag tgcagaaata cggactcacc ttgcttgtaa 240 

ctactgatct tgagctcata aaatacctaa ataatgtggt ggaacaactg aaagattggt 3 00 

tatacaagtg ttcagttcag aaactggttg tagttatctc aaatattgaa agtggtgagg 3 60 

tcctggaaag atggcagttt gatattgagt gtgacaagac tgcaaaagat gacagtgcac 42 0 

ccagagaaaa gtctcagaaa gctatccagg atgaaatccg ttcagtgatc agacagatca 480 

cagctacggt gacatttctg ccactgttgg aagtttcttg ttcatttgat ctgctgattt 540 

atacagacaa agatttggtt gtacctgaaa aatgggaaga gtcgggacca cagtttatta 60 0 

ccaattctga ggaagtccgc cttcgttcat ttactactac aatccacaaa gtaaatagca 66 0 

tggtggccta caaaattcct gtcaatgact gaggatgaca tgaggaaaat aatgtaattg 72 0 

taattttgaa atgtggtttt cctgaaatca ggtcatctat agttgatatg ttttatttca 7 80 

ttggttaatt tttacatgga gaaaaccaaa atgatactta ctgaactgtg tgtaattgtt 840 

cctttatttt tttggtacct atttgactta ccatggagtt aacatcatga atttattgca 9 00 

cattgttcaa aaggaaccag gaggtttttt tgtcaacatt gtgatgtata ttcctttgaa 9 60 

gatagtaact gtagatggaa aaacttgtgc tataaagcta gatgctttcc taaatcagat 1020 

gttttggtca agtagtttga ctcagtatag gtagggagat atttaagtat aaaatacaac 1080 

aaaggaagtc taaatattca gaatctttgt taaggtcctg aaagtaactc ataatctata 1140 

aacaatgaaa tattgctgta tagctccttt tgaccttcat ttcatgtata gttttcccta 1200 

ttgaatcagt ttccaattat ttgactttaa tttatgtaac ttgaacctat gaagcaatgg 1260 

atatttgtac tgtttaatgt tctgtgatac agaactctta aaaatgtttt ttcatgtgtt 132 0 

ttataaaatc aagttttaag tgaaagtgag gaaataaagt taagtttgtt ttaaaaaaaa 1380 
aaaaaaaaaa 13 90 

<210> 100 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 
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<308> NM_002358 
<400> 100 

atgctttcct aaatcagatg ttttggtcaa gtagtttgac tcagtatagg tagggagata 60 

<210> 101 

<211> 1821 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM__002422 
<400> 101 

acaaggaggc aggcaagaca gcaaggcata gagacaacat agagctaagt aaagccagtg 60 

gaaatgaaga gtcttccaat cctactgttg ctgtgcgtgg cagtttgctc agcctatcca 12 0 

ttggatggag ctgcaagggg tgaggacacc agcatgaacc ttgttcagaa atatctagaa 180 

aactactacg acctcaaaaa agatgtgaaa cagtttgtta ggagaaagga cagtggtcct 240 

gttgttaaaa aaatccgaga aatgcagaag ttccttggat tggaggtgac ggggaagctg 3 00 

gactccgaca ctctggaggt gatgcgcaag cccaggtgtg gagttcctga tgttggtcac 3 60 

ttcagaacct ttcctggcat cccgaagtgg aggaaaaccc accttacata caggattgtg 42 0 

aattatacac cagatttgcc aaaagatgct gttgattctg ctgttgagaa agctctgaaa 480 

gtctgggaag aggtgactcc actcacattc tccaggctgt atgaaggaga ggctgatata 5 40 

atgatctctt ttgcagttag agaacatgga gacttttacc cttttgatgg acctggaaat 600 

gttttggccc atgcctatgc ccctgggcca gggattaatg gagatgccca ctttgatgat 660 

gatgaacaat ggacaaagga tacaacaggg accaatttat ttctcgttgc tgctcatgaa 720 

attggccact ccctgggtct ctttcactca gccaacactg aagctttgat gtacccactc 7 80 

tatcactcac tcacagacct gactcggttc cgcctgtctc aagatgatat aaatggcatt 840 

cagtccctct atggacctcc ccctgactcc cctgagaccc ccctggtacc cacggaacct 900 

gtccctccag aacctgggac gccagccaac tgtgatcctg ctttgtcctt tgatgctgtc 9 60 

agcactctga ggggagaaat cctgatcttt aaagacaggc acttttggcg caaatccctc 102 0 

aggaagcttg aacctgaatt gcatttgatc tcttcatttt ggccatctct tccttcaggc 1080 

gtggatgccg catatgaagt tactagcaag gacctcgttt tcatttttaa aggaaatcaa 114 0 

ttctgggcca tcagaggaaa tgaggtacga gctggatacc caagaggcat ccacacccta 12 0 0 

ggtttccctc caaccgtgag gaaaatcgat gcagccattt ctgataagga aaagaacaaa 12 60 

acatatttct ttgtagagga caaatactgg agatttgatg agaagagaaa ttccatggag 132 0 

ccaggctttc ccaagcaaat agctgaagac tttccaggga ttgactcaaa gattgatgct 13 80 

gtttttgaag aatttgggtt cttttatttc tttactggat cttcacagtt ggagtttgac 1440 

ccaaatgcaa agaaagtgac acacactttg aagagtaaca gctggcttaa ttgttgaaag 15 0 0 

agatatgtag aaggcacaat atgggcactt taaatgaagc taataattct tcacctaagt 156 0 

ctctgtgaat tgaaatgttc gttttctcct gcctgtgctg tgactcgagt cacactcaag 162 0 

ggaacttgag cgtgaatctg tatcttgccg gtcattttta tgttattaca gggcattcaa 16 8 0 

atgggctgct gcttagcttg caccttgtca catagagtga tctttcccaa gagaagggga 1740 
agcactcgtg tgcaacagac aagtgactgt atctgtgtag actatttgct tatttaataa 180 0 
agacgatttg tcagttgttt t 1821 

<210> 102 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002422 
<400> 102 

tgtagaaggc acaatatggg cactttaaat gaagctaata attcttcacc taagtctctg 60 

<210> 103 
<211> 2787 
<212> DNA 



48 



WO 2005/086891 



PCT/US2005/007894 



<213> Homo sapiens 
<300> 

<308> NM„002462 



<400> 103 

agagcggagg 

ggaatcccag 

cagctcaggg 

gctacaggaa 

attccaagga 

tatttgaagg 

gtggacatcg 

gctactgtgg 

gaggagaagg 

caggacctgg 

gtgttggagg 

ccgctggtgc 

taccaggact 

gcccagaatg 

atcagctccc 

gctgtgggca 

atccagaggc 

acagaggctc 

ttgacgaagc 

aacctcgtgt 

atccaggacc 

cacccatatt 

aaacttacca 

atcaaggaga 

gaagacgaaa 

atcactgctc 

agactccgac 

cataaaattt 

ccaggctttg 

gaagagccgg 

gatgtttcga 

attgaagaca 

cagatggaac 

agagagaagg 

tccagctcgg 

caggaggcca 

cagacgtacg 

tacagctggc 

cggcttgcac 

tctgtccagc 

gactgacgac 

ccgtggtgat 

ccaccctcag 

ggattttcag 

aactgacaca 

cccagtgttt 

tctctgtaat 



ccgcactcca 

tgtcacggtg 

gctttggaat 

gactcccact 

ggtgcaggag 

aacgtatatt 

caaaagctga 

cccagaaaaa 

tgcgcccctg 

ccctgccagc 

cactgtcagg 

tgaaactgaa 

acgagattga 

ccatcgccgg 

gagatgtccc 

atcagcctgc 

aggagacaat 

tcagcatggc 

ctgatctggt 

tccacctgaa 

agctgagcct 

tcagggatct 

gcgagctcat 

ctcaccagag 

atgaaaaaat 

tcatgcaagg 

acgagttcca 

tgagtagaaa 

tgaattacag 

ctgtggatat 

taaaaaattt 

ttagagcaga 

agattgtcta 

agctggaaga 

caacagactc 

gcaagcgcat 

gccagcagct 

tcctgaagga 

ggctgacgca 

cccgtagacg 

ttgagtgctc 

ttagcaggaa 

atgcacatga 

ccctcagaat 

tgctgaacat 

taggagcatg 

aaactcattt 



gcactgcgca gggaccgcct 
gacacgcctc cctcgcgccc 
tctgtggcca cactgcgagg 
ccctgaaatc tggagtgaag 
aacagctctg tgataccatt 
agagcttact ttgcaaagaa 
tccagctgct gcatcccacc 
tccaggctcg gtggctgaga 
catcgacctc attgactccc 
catcgccgtc atcggggacc 
agttgccctt cccagaggca 
gaaacttgtg aacgaagata 
gatttcggat gcttcagagg 
ggaaggaatg ggaatcagtc 
ggatctgact ctaatagacc 
tgacattggg tataagatca 
cagcctggtg gtggtcccca 
ccaggaggtg gaccccgagg 
ggacaaagga actgaagaca 
gaagggttac atgattgtca 
gtccgaagcc ctgcagagag 
gctggaggaa ggaaaggcca 
cacacatatc tgtaaatctc 
aataacagag gagctacaaa 
gttcttcctg atagataaaa 
agaggaaact gtaggggagg 
caaatggagt acaataattg 
aatccagaaa tttgaaaatc 
gacatttgag acaatcgtga 
gctacacacc gtgacggata 
tgaagagttt tttaacctcc 
acaagagaga gaaggtgaga 
ctgccaggac caggtataca 
agaaaagaag aagaaatcct 
ttccatggag gagatctttc 
ctccagccac atccctttga 
tcagaaggcc atgctgcagc 
gcggagcgac accagcgaca 
ggctcggcgc cggcttgccc 
tgcacgcaca ctgtctgccc 
agtagtcaga ctggatagtc 
gctgtgagag cagtttggtt 
gctggcggga ttgaaggatg 
cgctccacct tgcagctctc 
cacagcttat ttcctcattt 
agtgccgtgt gtgtgcgtcc 
ctagcag 2787 



tggaccgcag 

ttgccgccca 

agatcggttc 

aacgccgcca 

taacttgttg 

ggaagatggt 

ctctattact 

acaacctgtg 

tgcgggctct 

agagctcggg 

gcgggatcgt 

agtggagagg 

tagaaaagga 

atgagctaat 

ttcctggcat 

agacactcat 

gtaatgtgga 

gagacaggac 

aggttgtgga 

agtgccgggg 

agaagatctt 

cggttccctg 

tgcccctgtt 

agtatggtgt 

ttaatgcctt 

aagacattcg 

aaaacaattt 

agtatcgtgg 

aacagcaaat 

tggtccggct 

acagaaccgc 

agctgatccg 

ggggtgcatt 

gggattttgg 

agcacctgat 

tcatccagtt 

tcctgcagga 

agcggaagtt 

agttccccgg 

ccgttcccgg 

cgtctctgct 

tctagcatga 

ctgtcttcgt 

cccttctctg 

ttataatgtc 

tgtcggagcc 



ttgccggcca 

cctgctcacc 

tgggtcggag 

tccagccacc 

acattacttt 

tgtttccgaa 

gaatggagat 

cagccagtat 

aggtgtggag 

caagagctcc 

gaccagatgc 

caaggtcagt 

aattaataaa 

caccctggag 

aaccagagtg 

caagaagtac 

catcgccacc 

catcggaatc 

cgtggtgcgg 

ccagcaggag 

ctttgagaac 

cctggcagaa 

agaaaatcaa 

cgacataccg 

taatcaggac 

gctgtttacc 

tcaagaaggc 

tagagagctg 

caaggcactg 

tgctttcaca 

caagtccaaa 

cctccacttc 

gcagaaggtc 

ggctttccag 

ggcctatcac 

cttcatgctc 

caaggacacc 

cctgaaggag 

ttaaccacac 

gtagccactg 

tatccgttag 

agacagagcc 

actgggaaag 

tattcctaga 

ccttcacaaa 

ctgtctcctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



<210> 104 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_002462 
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<400> 104 

actgacacat gctgaacatc acagcttatt tcctcatttt tataatgtcc cttcacaaac 6 0 

<210> 105 

<211> 2808 

<212> DNA 

<213> Homo sapiens 

<3 00> 

<308> NM_002759 



<400> 105 
gcggcggcgg 
tccacacttg 
gaggcatcga 
ttcgctggta 
ctcttttacc 
acctcaaaac 
aatcaacatc 
atacgggaag 
acataccgtc 
ccacatgata 
gaaggtagat 
aataaggaaa 
ttatccatgg 
gtaaattatg 
aaaatgggac 
ttggccgcta 
tacctgtcct 
agcacactcg 
aattctaaca 
caaaggaagg 
aagtatactg 
ggtggatttg 
aaacgtgtta 
gatcatgtaa 
accagtgatg 
aggtcaaaga 
caatggattg 
gaacaaataa 
aagccaagta 
gtaacatctc 
agcccagaac 
ctaattcttg 
acagacctac 
cagaaattac 
ttgactgtgt 
gaaaaagtat 
atatcaatag 
aatctttctg 
tactttttcc 
tgttgcccag 
gttcaagtga 
acccaactaa 
ctcaaactcc 
gggatgagcc 
aattttcttt 
cgcttttagg 
ggagaaatat 



cggcgcagtt 
gtagaaccac 
ggtccatccc 
tactcgctgt 
agtttctgga 
tttagcagtt 
cacacttccg 
aagaaatggc 
agaagcaggg 
ggaggtttac 
caaagaagga 
agaaggcagt 
ggaattacat 
aacagtgtgc 
agaaagaata 
aacttgcata 
ctggttcttt 
cttctgaatc 
gtgacagttt 
caaaaagatc 
tggacaagag 
gccaagtttt 
aatataataa 
atattgttca 
attctcttga 
ctaagtgcct 
aaaaaagaag 
caaaaggggt 
atatattctt 
tgaaaaatga 
agatttcttc 
ctgaacttct 
gggatggcat 
tctcaaagaa 
ggaagaaaag 
cctgcttctg 
atatttacct 
cagaaacaga 
tggctcatct 
gctggagtgc 
ttctcctgcc 
tttttgtgtt 
tgacctcaag 
accgcgccca 
ttatactatt 
ccaaaaaaat 
ggtactcatt 



tgctcatact ttgtgacttg cggtcacagt ggcattcagc 60 

agcatagaaa catcctaaac aatcttcatc 12 0 

aggagaccct ggctatcata gaccttagtc 180 

gcggttgact ttttttaagc cttctttttt 240 

tttgccttcc tggatttgta aattgtaatg 3 00 

actcaggttt gcttctctgg cggtcttcag 3 60 

cgtgcatttt ggacaaagct tccaaccagg 42 0 

tcagcaggtt tcttcatgga ggaacttaat 4 80 

aaatatcaag aactgcctaa ttcaggacct 540 

ataatagatg gaagagaatt tccagaaggt 600 

gccgcagcca aattagctgt tgagatactt 6 60 

ttattgacaa caacgaattc ttcagaagga 720 

aatagaattg cccagaagaa aagactaact 7 80 

catgggccag aaggatttca ttataaatgc 840 

acaggttcta ctaaacagga agcaaaacaa 9 00 

ttatcagaag aaacctcagt gaaatctgac 9 60 

tgtgagtccc aaagcaactc tttagtgacc 102 0 

ggtgacttct cagcagatac atcagagata 1080 

tcgttgctta tgaatggtct cagaaataat 1140 

agatttgacc ttcctgacat gaaagaaaca 120 0 

gattttaaag aaatagaatt aattggctca 12 6 0 

cacagaattg acggaaagac ttacgttatt 132 0 

gagcgtgaag taaaagcatt ggcaaaactt 1380 

tgttgggatg gatttgatta tgatcctgag 144 0 

tatgatcctg agaacagcaa aaatagttca 15 00 

atggaattct gtgataaagg gaccttggaa 1560 

ctagacaaag ttttggcttt ggaactcttt 162 0 

cattcaaaaa aattaattca tagagatctt 1680 

aaacaagtaa agattggaga ctttggactt 1740 

acaaggagta agggaacttt gcgatacatg 180 0 

ggaaaggaag tggacctcta cgctttgggg 186 0 

gacactgctt ttgaaacatc aaagtttttc 192 0 

atatttgata aaaaagaaaa aactcttcta 19 80 

cgacctaaca catctgaaat actaaggacc 2 040 

aatgaacgac acacatgtta gagcccttct 2100 

ttccttaaat tatctaaaat ctgctaggga 2160 

tgtttccttt aattttttac tatttttact 2220 

tctttttgct tcaaaaacat tcttacattt 2280 

tttttttttt ttaaagacag agtctcgctc 23 40 

tcttggctca ctgcaacttc tgcctcttgg 2400 
gagtagctgg attacaggca tgtgccaccc 2 460 
acagggtttc accatgttgg ccaggctggt 2520 
gcctcggcct cccaaagtgc tgggattaca 2 580 
tttgttctaa agatggaaaa accaccccca 2 640 
tcaattcata tctatttatt aaatttctac 2700 
tctctgcctc acatagctta caagccagct 2760 
aaaaagtgat gtacaacc 2 808 



aggcacgaca 
aataaaaatc 
ctgtcaacca 
gcaaattcag 
cttccatctg 
tgattatctg 
tggtgatctt 
agtagtactt 
atttcaagtt 
agcaaaaaat 
tagtccttta 
aggccttatc 
atcgggggtg 
tagtattggt 
tcttcagata 
tgctactacg 
atcatctgaa 
aaacagttct 
tttggcaccc 
gtttggcatg 
caaagcaaaa 
cgagaaggcg 
ctacaatggc 
gagcagtgat 
tttcatccaa 
aggcgagaaa 
ggattatata 
agtagataca 
tggaaagcga 
gcaagactat 
tcatgtatgt 
catctcagat 
acctgaggat 
cccagagaaa 
atatgcagtt 
tttattttaa 
aaggttttct 
ctttattctt 
aatgacacag 
tcagcctcct 
tttaataaag 
taatccacct 
gcctcatctc 
aatgaatcaa 
gtaagatcgt 
aaaaaaaaaa 
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<210> 106 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002759 
<400> 106 

tcgttctctg cctcacatag cttacaagcc agctggagaa atatggtact cattaaaaaa 60 

<210> 107 

<211> 1678 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002811 
<400> 107 

aagaaggagg ccgcgcgagg gctgacgaac cggaagaaga ggaactgggc ctgaaagggt 60 

accggtgacc gctactgctg ccggtgtttg cgtgtggcag ggagccaggc ctggcgagcg 12 0 

gggtgtgtcg cgatgccgga gctggcagtg cagaaggtgg tggtccaccc cctggtgctg 180 

ctcagtgtgg tggatcattt caaccgaatc ggcaaggttg gaaaccagaa gcgtgttgtt 240 

ggtgtgcttt tggggtcatg gcaaaagaaa gtacttgatg tatcgaacag ttttgcagtt 3 00 

ccttttgatg aagatgacaa agacgattct gtatggtttt tagaccatga ttatttggaa 360 

aacatgtatg gaatgtttaa gaaagtcaat gccagggaaa gaatagttgg ctggtaccac 42 0 

acaggcccta aactacacaa gaatgacatt gccatcaacg aactcatgaa aagatactgt 480 

cctaattccg tattggtcat cattgatgtg aagccgaagg acctagggct gcctacagaa 540 

gcgtacattt cagtggaaga agtccatgat gatggaactc caacctcgaa aacatttgaa 600 

cacgtgacca gtgaaattgg agcagaggaa gctgaggaag ttggagttga acacttgtta 6 60 

cgagatatca aagacacgac ggtgggcact ctgtcccagc ggatcacaaa ccaggtccat 72 0 

ggtttgaagg gactgaactc caagcttctg gatatcagga gctacctgga aaaagtcgcc 7 80 

acaggcaagc tgcccatcaa ccaccagatc atctaccagc tgcaggacgt cttcaacctg 840 

ctgccagatg tcagcctgca ggagttcgtc aaggcctttt acctgaagac caatgaccag 9 00 

atggtggtag tgtacttggc ctcgctgatc cgttccgtgg tcgccctgca caacctcatc 960 

aacaacaaga ttgccaaccg ggatgcagag aagaaagaag ggcaggagaa agaagagagc 102 0 

aaaaaggata ggaaagagga caaggagaaa gataaagata aggaaaagag tgatgtaaag 10 80 

aaagaggaga aaaaggagaa aaagtaaaac atgtattaaa tagctttttt aatttgtaaa 1140 

ttaaaatctt acaaactaaa tcagtgtgct gctagagggt tctttttcac ttgacatgct 12 00 

tattagaaag ctgacccaac aagagctctc tgcctccggt cactcttgct gtggtgctac 12 60 

gtggaagtga atggagactg atctcaaatc tgaactgcag ctttcgctgc tgtgagttgg 132 0 

gg'atatgata gtcagctcag gcttcagatt gtatgagaaa aatgaagaga agtcaacaaa 13 80 

tattttggta ctcttcattc atttatctct aaaaccagga gttgaatttt cctcatcttg 1440 

aaagactctt ggggtctgtt tctggtattt tacaaaattg ctaagtggaa tgcatgaatt 1500 

gcattatgfct ctctggtaac acgtagagtt cagacccttc tgaactctgt tgataatacc 15 60 

acaccatgtt ctggacccat agctctggca tcctcagggg ttgtgatcca gctccatata 162 0 
ttgtttacct tcaaagatac aattaaatgg cttgattttt aaaaaaaaaa aaaaaaaa 167 8 

<210> 108 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002811 
<400> 108 

aaattgctaa gtggaatgca tgaattgcat tatgttctct ggtaacacgt agagttcaga 60 
<210> 109 
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<2H> 846 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> NM_002888 
<400> 109 

ccacgtccgg ggtgccgagc caactttcct gcgtccatgc agccccgccg gcaacggctg 60 

cccgctccct ggtccgggcc caggggcccg cgccccaccg ccccgctgct cgcgctgctg 12 0 

ctgttgctcg ccccggtggc ggcgcccgcg gggtccgggg gccccgacga ccctgggcag 180 

cctcaggatg ctggggtccc gcgcaggctc ctgcagcaga aggcgcgcgc ggcgcttcac 2 40 

ttcttcaact tccggtccgg ctcgcccagc gcgctgcgag tgctggccga ggtgcaggag 3 00 

ggccgcgcgt ggattaatcc aaaagaggga tgtaaagttc acgtggtctt cagcacagag 3 60 

cgctacaacc cagagtcttt acttcaggaa ggtgagggac gtttggggaa atgttctgct 42 0 

cgagtgtttt tcaagaatca gaaacccaga ccaaccatca atgtaacttg tacacggctc 480 

atcgagaaaa agaaaagaca acaagaggat tacctgcttt acaagcaaat gaagcaactg 540 

aaaaacccct tggaaatagt cagcatacct gataatcatg gacatattga tccctctctg 6 00 

agactcatct gggatttggc tttccttgga agctcttacg tgatgtggga aatgacaaca 6 60 

caggtgtcac actactactt ggcacagctc actagtgtga ggcagtgggt aagaaaaacc 720 

tgaaaattaa cttgtgccac aagagttaca atcaaagtgg tctccttaga ctgaattcat 7 80 

gtgaacttct aatttcatat caagagttgt aatcacattt atttcaataa atatgtgagt 840 
tcctgc 846 

<210> 110 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002888 
<400> 110 

aaagaaaaga caacaagagg attacctgct ttacaagcaa atgaagcaac tgaaaaaccc 60 

<210> 111 

<211> 1054 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003090 
<400> 111 

gaattccgcg ggaggccacg ggctttccac agcgcggggg aacgggaggc tgcaggatgg 6 0 
tcaagctgac ggcggagctg atcgagcagg cggcgcagta caccaacgcg gtgcgcgacc 12 0 
gggagctgga cctccggggg tataaaattc ccgtcattga aaatctaggt gctacgttag 180 

accagtttga tgctattgat ttttctgaca atgagatcag gaaactggat ggttttcctt 240 

tgttgagaag actgaaaaca ttgttagtga acaacaacag aatatgccgt ataggtgagg 3 00 

gacttgatca ggctctgccc tgtctgacag aactcattct caccaataat agtctcgtgg 3 60 

aactgggtga tctggaccct ctggcatctc tcaaatcgct gacttaccta agtatcctaa 42 0 

gaaatccggt aaccaataag aagcattaca gattgtatgt gatttataaa gttccgcaag 480 

tcagagtact ggatttccag aaagtgaaac taaaagagcg tcaggaagca gagaaaatgt 540 

tcaagggcaa acggggtgca cagcttgcaa aggatattgc caggagaagc aaaactttta 600 

atccaggtgc tggtttgcca actgacaaaa agagaggtgg gccatctcca ggggatgtag 660 

aagcaatcaa gaatgccata gcaaatgctt caactctggc tgaagtggag aggctgaagg 720 

ggttgctgca gtctggtcag atccctggca gagaacgcag atcagggccc actgatgatg 780 

gtgaagaaga gatggaagaa gacacagtca caaacgggtc ctgagcagtg aggcagatgt 840 

ataataatag gccctcttgg aacaagtctt gcttttcgaa catggtataa tagccttgtt 900 
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tgtgttagca aagtggaatc tatcagcatt gttgaaatgc ttaagactgc tgctgataat 960 

tttgtaatat aagttttgaa atctaaatgt caattttcta caaattataa aaataaactc 102 0 
cactctctat gctaaaaaaa aaaaaaagga attc 1054 

<210> 112 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003090 
<400> 112 

taatagcctt gtttgtgtta gcaaagtgga atctatcagc attgttgaaa tgcttaagac 6 0 

<210> 113 

<211> 2033 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003158 
<400> 113 

gaattccggg actgagctct tgaagacttg ggtccttggt cgcaggtgga gcgacgggtc 60 

tcactccatt gcccaggcca gagtgcggga tatttgataa gaaacttcag tgaaggccgg 12 0 

gcgcggtgct catgcccgta atcccagcat tttcggaggc cgaggcatca tggaccgatc 180 

taaagaaaac tgcatttcag gacctgttaa ggctacagct ccagttggag gtccaaaacg 240 

tgttctcgtg actcagcaat ttccttgtca gaatccatta cctgtaaata gtggccaggc 3 00 

tcagcgggtc ttgtgtcctt caaattcttc ccagcgcgtt cctttgcaag cacaaaagct 360 

tgtctccagt cacaagccgg ttcagaatca gaagcagaag caattgcagg caaccagtgt 420 

acctcat'cct gtctccaggc cactgaataa cacccaaaag agcaagcagc ccctgccatc 480 

gcacctgaaa ataatcctga ggaggaactg gcatcaaaac agaaaaatga agaatcaaaa 540 

agaggcagtg gctttggaag actttgaaat tggtcgccct ctgggtaaag gaaagtttgg 600 

taatgtttat ttggcaagag aaaagcaaag caagtttatt ctggctctta aagtgttatt 660 

taaagctcag ctggagaaag ccggagtgga gcatcagctc agaagagaag tagaaataca 72 0 

gtcccacctt cggcatccta atattcttag actgtatggt tatttccatg atgctaccag 7 80 

agtctaccta attctggaat atgcaccact tggaacagtt tatagagaac ttcagaaact 840 

ttcaaagttt gatgagcaga gaactgctaa cttatataac agaattgcaa atgccctgtc 900 

ttactgtcat tcgaagagag ttattcatag agacattaag ccagagaact tacttcttgg 960 

atcagctgga gagcttaaaa ttgcagattt tgggtggtca gtacatgctc catcttccag 102 0 

gaggaccact ctctgtggca ccctggacta cctgccccct gaaatgattg aaggtcggat 1080 

gcatgatgag aaggtggatc tctggagcct tggagttctt tgctatgaat ttttagttgg 1140 

gaagcctcct tttgaggcaa acacatacca agagacctac aaaagaatat cacgggttga 12 00 

attcacattc cctgactttg taacagaggg agccagggac ctcatttcaa gactgttgaa 12 6 0 

gcataatccc agccagaggc caatgctcag agaagtactt gaacacccct ggatcacagc 132 0 

aaattcatca aaaccatcaa attgccaaaa caaagaatca gctagcaaac agtcttagga 13 80 

atcgtgcagg gggagaaatc cttgagccag ggctgccata taacctgaca ggaacatgct 1440 

actgaagttt attttaccat tgactgctgc cctcaatcta gaacgctaca caagaaatat 1500 

tttgttttta ctcagcaggt gtgccttaac ctccctattc agaaagctcc acatcaataa 1560 

acatgacact ctgaagtgaa agtagccacg agaattgtgc tacttatact ggaacataat 1620 

ctggaggcaa ggttcgactg cagtcgaacc ttgcctccag attatgaacc agtataagta 1680 

gcacaattct cgtggctact ttcacttcag agtgtcatgt ttattgatgt ggagctttct 1740 

gaatagggag gttaaggcac acctgctgag taaaacaaat atttcttgtg tagcgttctt 1800 

aggaatctgg tgtctgtccg gccccggtag gcctgttggg tttctagtcc tccttaccat 1860 

catctccata tgagagtgtg aaaataggaa cacgtgctct acctccattt agggatttgc 192 0 

ttgggataca gaagaggcca tgtgtctcag agctgttaag ggcttatttt tttaaaacat 1980 
tggagtcata gcatgtgtgt aaactttaaa tatgcaggcc ttcgtggctc gag 2033 

<210> 114 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003158 
<400> 114 

ttgggtttct agtcctcctt accatcatct ccatatgaga gtgtgaaaat aggaacacgt 60 

<210> 115 

<211> 1421 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003258 
<400> 115 

acttactgcg ggacggcctt ggagagtact cgggttcgtg aacttcccgg aggcgcaatg 60 

agctgcatta acctgcccac tgtgctgccc ggctccccca gcaagacccg ggggcagatc 12 0 

caggtgattc tcgggccgat gttctcagga aaaagcacag agttgatgag acgcgtccgt 180 

cgcttccaga ttgctcagta caagtgcctg gtgatcaagt atgccaaaga cactcgctac 240 

agcagcagct tctgcacaca tgaccggaac accatggagg cgctgcccgc ctgcctgctc 3 00 

cgagacgtgg cccaggaggc cctgggcgtg gctgtcatag gcatcgacga ggggcagttt 360 

ttccctgaca tcatggagtt ctgcgaggcc atggccaacg ccgggaagac cgtaattgtg 42 0 

gctgcactgg atgggacctt ccagaggaag ccatttgggg ccatcctgaa cctggtgccg 480 

ctggccgaga gcgtggtgaa gctgacggcg gtgtgcatgg agtgcttccg ggaagccgcc 540 

tataccaaga ggctcggcac agagaaggag gtcgaggtga ttgggggagc agacaagtac 600 

cactccgtgt gtcggctctg ctacttcaag aaggcctcag gccagcctgc cgggccggac 660 

aacaaagaga actgcccagt gccaggaaag ccaggggaag ccgtggctgc caggaagctc 72 0 

tttgccccac agcagattct gcaatgcagc cctgccaact gagggacctg caagggccgc 7 80 

ccgctccctt cctgccactg ccgcctactg gacgctgccc tgcatgctgc ccagccactc 840 

caggaggaag tcgggaggcg tggagggtga ccacaccttg gccttctggg aactctcctt 9 00 

tgtgtggctg ccccacctgc cgcatgctcc ctcctctcct acccactggt ctgcttaaag 9 60 

cttccctctc agctgctggg acgatcgccc aggctggagc tggccccgct tggtggcctg 1020 

ggatctggca cactccctct ccttggggtg agggacagag ccccacgctg ttgacatcag 1080 

cctgcttctt cccctctgcg gctttcactg ctgagtttct gttctccctg ggaagcctgt 1140 

gccagcacct ttgagccttg gcccacactg aggcttaggc ctctctgcct gggatgggct 12 00 

cccaccctcc cctgaggatg gcctggattc acgccctctt gtttcctttt gggctcaaag 12 60 

cccttcctac ctctggtgat ggtttccaca ggaacaacag catctttcac caagatgggt 132 0 

ggcaccaacc ttgctgggac ttggatccca ggggcttatc tcttcaagtg tggagagggc 13 80 
agggtccacg cctctgctgt agcttatgaa attaactaat t 1421 

<210> 116 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM„003258 
<400> 116 

cttcctacct ctggtgatgg tttccacagg aacaacagca tctttcacca agatgggtgg 60 

<210> 117 
<211> 913 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003311 
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<400> 117 

agagccggcg ccgtcaccgc ccgcattgcc 
aatcccccga cgaggtgcta cgcgagggcg 
agctatggaa gaagaagcgc ggggtgctca 
gcccccgcgc gcgccccaag gagctgcgct 
agcgcacggg caagtacgtg tacttcacca 
tccgctgcgc gggcgagagc tgctggaacg 
agaaccgccg cgccctgcag gactttcgca 
ccgccgagga cgccgtggct gccgcggccg 
ggccatcccc gcagcccaaa ccccgcacgc 

gagtcggacc gaggctagga cgtggccggc 
cgtgcgcgcg gatcctcgct ccgttgcacg 
ctatgtattt atttcgctgg ttctttgtag 
tttgcatcac tgtgcccatt gcaaataaat 
gctgtggctc agtgagactc ctgctgggag 
accctcatcc agtcctgcgg ggctggtgtg 
aaatgaatcg tgt 913 



gctcccagtc ccgcgctcgg cacgacatga 60 

agttggagaa gcgcagcgac agcctcttcc 12 0 

cctccgaccg cctgagcctg ttccccgcca 180 

tccactccat cctcaaggtg gactgcgtgg 2 40 

tcgtcaccac cgaccacaag gagatcgact 3 00 

cggccatcgc gctggcgctc atcgatttcc 3 60 

gccgccagga acgcaccgca cccgccgcac 42 0 

ccgcaccctc cgagccctcg gagccctcca 480 

catgagcccg ccgcgggcca tacgctggac 540 

gctctccagc cctgcagcag aagaacttcc 60 0 

ggcgccttaa gttattggac tatctaatat 66 0 

tcacatattt tatagtctta atatcttgtt 72 0 

cacttggcca gtttgctttt ctaccatccg 7 80 

ggtggaggcc caggaatggg cgggcaggac 840 

aaaggcgctg ggaaccggct ttgaatgaat 9 00 



<210> 118 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003311 



<400> 118 

atttcgctgg ttctttgtag tcacatattt tatagtctta atatcttgtt tttgcatcac 

<210> 119 

<211> 1723 

<212> DNA 

<213> Homo sapiens 



60 



<300> 

<308> NM_003376 
<400> 119 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 60 

cgctctgtcg ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg 12 0 

ctcggtgctg gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa 180 

catttttttt taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca 240 

cttgaatcgg gccgacggct tggggagatt gctctacttc cccaaatcac tgtggatttt 300 

ggaaaccagc agaaagagga aagaggtagc aagagctcca gagagaagtc gaggaagaga 3 60 

gagacggggt cagagagagc gcgcgggcgt gcgagcagcg aaagcgacag gggcaaagtg 420 

agtgacctgc ttttgggggt gaccgccgga gcgcggcgtg agccctcccc cttgggatcc 480 

cgcagctgac cagtcgcgct gacggacaga cagacagaca ccgcccccag ccccagctac 5 40 

cacctcctcc ccggccggcg gcggacagtg gacgcggcgg cgagccgcgg gcaggggccg 6 00 

gagcccgcgc ccggaggcgg ggtggagggg gtcggggctc gcggcgtcgc actgaaactt 660 

ttcgtccaac ttctgggctg ttctcgcttc ggaggagccg tggtccgcgc gggggaagcc 720 

gagccgagcg gagccgcgag aagtgctagc tcgggccggg aggagccgca gccggaggag 780 

ggggaggagg aagaagagaa ggaagaggag agggggccgc agtggcgact cggcgctcgg 840 

aagccgggct catggacggg tgaggcggcg gtgtgcgcag acagtgctcc agccgcgcgc 900 

gctccccagg ccctggcccg ggcctcgggc cggggaggaa gagtagctcg ccgaggcgcc 960 
gaggagagcg ggccgcccca cagcccgagc cggagaggga gcgcgagccg cgccggcccc 
ggtcgggcct ccgaaaccat gaactttctg ctgtcttggg tgcattggag ccttgccttg 

ctgctctacc tccaccatgc caagtggtcc caggctgcac ccatggcaga aggaggaggg 1140 

cagaatcatc acgaagtggt gaagttcatg gatgtctatc agcgcagcta ctgccatcca 1200 

atcgagaccc tggtggacat cttccaggag taccctgatg agatcgagta catcttcaag 12 60 



1020 
1080 



55 



WO 2005/086891 



PCT/US2005/007894 



ccatcctgtg 
gtgcccactg 
cagcacatag 
gatagagcaa 
cgcaagaaat 
aagcatttgt 
cgttgcaagg 
cggtgagccg 



tgcccctgat 
aggagtccaa 
gagagatgag 
gacaagaaaa 
cccggtataa 
ttgtacaaga 
cgaggcagct 
ggcaggagga 



gcgatgcggg 
catcaccatg 
cttcctacag 
aaaatcagtt 
gtcctggagc 
tccgcagacg 
tgagttaaac 
aggagcctcc 



ggctgctgca 
cagattatgc 
cacaacaaat 
cgaggaaagg 
gttccctgtg 
tgtaaatgtt 
gaacgtactt 
ctcagggttt 



atgacgaggg 
ggatcaaacc 
gtgaatgcag 
gaaaggggca 
ggccttgctc 
cctgcaaaaa 
gcagatgtga 
egg 1723 



cctggagtgt 
tcaccaaggc 
accaaagaaa 
aaaacgaaag 
agageggaga 
cacagactcg 
caagecgagg 



1320 
1380 
1440 
1500 
1560 
1620 
1680 



<210> 120 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003376 
<400> 120 

ccagcacata ggagagatga gcttcctaca gcacaacaaa tgtgaatgca gaccaaagaa 60 

<210> 121 

<211> 2834 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003406 



<400> 121 

gcccactccc 

aggagataaa 

gecgagcagg 

ggagctgaat 

ggagcccgta 

gagaaaaaac 

atetgeaatg 

gagagcaaag 

gccgctggfcg 

tttgaaatca 

aacttctctg 

aagacagctt 

gacagcacgc 

caaggagacg 

tctgcctcat 

gttttttgtt 

catgtggttt 

tgttttcatc 

ttggcatagt 

gtctaagcaa 

ttggttccag 

ctgtggtaga 

tcaatgtgta 

attttactct 

gttagctttc 

taagaacaaa 

atgtatatca 

ttgcttttgt 

aatagttaac 

tttcatgaac 

ataaaagttt 



accgccagct 
aagaacatcc 
ctgagegata 
tatccaatga 
ggtcatcttg 
agcagatggc 
afcgtactgtc 
tcttctattt 
atgacaagaa 
gcaaaaagga 
tgttctatta 
ttgatgaagc 
taataatgea 
aagctgaagc 
tctaaaattt 
tacgatttat 
ttatgtttaa 
ttgaggtggc 
acttfctggta 
agaaaactgc 
tcacaggtgt 
atcatacccc 
cacctttgac 
ggataagggc 
attatttttg 
tgtaaaagta 
agcacagcag 
gatttttttt 
agggaaataa 
aatccaagca 
tatgaatgga 



ggaaccctgg 
agtcatggat 
tgatgacatg 
ggagaggaat 
gagggtegtc 
tcgagaatac 
tcttttggaa 
gaaaatgaaa 
agggattgtc 
aatgeaacca 
tgagattctg 
cattgetgaa 
attactgaga 
aggagaagga 
acacagtaga 
gacaggttta 
tattagggga 
caatatgggg 
cattgtggct 
ctacatactg 
agtaattgtg 
atggatacca 
tacagctgea 
agaaaeggtt 
ctacactcat 
aagatgeagg 
taaaacaaaa 
tttttttttt 
cttgagatga 
taattgttaa 
cttttcaact 



ggactacgac 
aaaaatgagc 
gcagcctgca 
cttctctcag 
tcaagtattg 
agagagaaaa 
aagttcttga 
ggagattact 
gatcagtcac 
acacatccta 
aactccccag 
cttgatacat 
gacaacttga 
ggggaaaatt 
ccatttgtca 
tgttacttct 
gtagagccag 
atgtggaatt 
tcaaaagggc 
gtttgtcctg 
ggtactttaa 
catattaaac 
gaagtgttcc 
cacattccat 
tttatttgta 
aaaaatgaat 
acccatgtat 
gatacttgee 
tggctagctt 
gaacacgtgt 
actttctcta 



gtccctcaaa 
tggttcagaa 
tgaagtctgt 
ttgcttataa 
aacaaaagac 
ttgagacgga 
tccccaatgc 
acegttaett 
aacaagcata 
tcagactggg 
agaaagectg 
taagtgaaga 
cattgtggac 
aaccggcctt 
tecatgetgt 
atttgaattt 
ttaacattta 
tttatacaag 
cagtgtaaaa 
geggggaata 
ggtttggagc 
catgtatatc 
tttagacaaa 
tatttgtaaa 
tttaaatgtt 
tgcttggtat 
ttaacttttt 
taacatgeat 
tgtttaatgt 
attaaattca 
cagcttttca 



ccttgcttct 
ggccaaactg 
aactgagcaa 
aaatgttgta 
ggaaggtgct 
gctaagagat 
ttcacaagca 
ggctgaggtt 
ccaagaagct 
tctggccctt 
ctctcttgca 
gtcatacaaa 
ateggatace 
ccaacttttg 
cccacaaata 
ctatatttcc 
gggagttatc 
ttataagtgt 
ctgcttccat 
aaagggatca 
acttacaagg 
tgtggaatac 
gttgtgaccc 
gttacctget 
ttaggcaacc 
tcattacttc 
tttaggattt 
gtgctgtaaa 
cttatgaaat 
tgtaagtgga 
tgtaaattag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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tcttggttct gaaacttctc taaaggaaat tgtacatttt ttgaaattta ttccttattc 1920 

cctcttggca gctaatgggc tcttaccaag tttaaacaca aaatttatca taacaaaaat 1980 

actactaata taactactgt ttccatgtcc catgatcccc tctcttcctc cccaccctga 2040 

aaaaaatgag ttcctatttt ttctgggaga gggggggatt gattagaaaa aaatgtagtg 2100 

tgttccattt aaaattttgg catatggcat tttctaactt aggaagccac aatgttcttg 2160 

gcccatcatg acattgggta gcattaactg taagttttgt gcttccaaat cactttttgg 222 0 

tttttaagaa tttcttgata ctcttatagc ctgccttcaa ttttgatcct ttattctttc 22 80 

tatttgtcag gtgcacaaga ttaccttcct gttttagcct tctgtcttgt caccaaccat 2340 

tcttacttgg tggccatgta cttggaaaaa ggccgcatga tctttctggc tccactcagt 2400 

gtctaaggca ccctgcttcc tttgcttgca tcccacagac tatttccctc atcctattta 2460 

ctgcagcaaa tctctcctta gttgatgaga ctgtgtttat ctccctttaa aaccctacct 2520 

atcctgaatg gtctgtcatt gtctgccttt aaaatccttc ctctttcttc ctcctctatt 2580 

ctctaaataa tgatggggct aagttatacc caaagctcac tttacaaaat ^tttcctcag 2 640 

tactttgcag aaaacaccaa acaaaaatgc cattttaaaa aaggtgtatt ttttctttta 2700 

gaatgtaagc tcctcaagag cagggacaat gttttctgta tgttctattg tgcctagtac 2760 

actgtaaatg ctcaataaat attgatgatg ggaggcagtg agtcttgatg ataagggtga 282 0 
gaaactgaaa tccc 2 834 

<210> 122 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003406 



<400> 122 

tttagccttc tgtcttgtca ccaaccattc ttacttggtg gccatgtact tggaaaaagg 



60 



<210> 123 

<211> 1938 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003504 



<400> 123 cn 

gatttggcgg gagtcttgac cgccgccggg ctcttggtac ctcagcgcga gcgccaggcg 60 

tccggccgcc gtggctatgt tcgtgtccga tttccgcaaa gagttctacg aggtggtcca 120 

gagccagagg gtccttctct tcgtggcctc ggacgtggat gctctgtgtg cgtgcaagat 180 

ccttcaggcc ttgttccagt gtgaccacgt gcaatatacg ctggttccag tttctgggtg 240 

gcaagaactt gaaactgcat ttcttgagca taaagaacag tttcattatt ttattctcat 300 

aaactgtgga gctaatgtag acctattgga tattcttcaa cctgatgaag acactatatt 360 

ctttgtgtgt gacacccata ggccagtcaa tgtcgtcaat gtatacaacg atacccagat 420 

caaattactc attaaacaag atgatgacct tgaagttccc gcctatgaag acatcttcag 480 

ggatgaagag gaggatgaag agcattcagg aaatgacagt gatgggtcag agccttctga 540 

gaagcgcaca cggttagaag aggagatagt ggagcaaacc atgcggagga ggcagcggcg 600 

agagtgggag gcccggagaa gagacatcct ctttgactac gagcagtatg aatatcatgg 66 0 

gacatcgtca gccatggtga tgtttgagct ggcttggatg ctgtccaagg acctgaatga 720 

catgctgtgg tgggccatcg ttggactaac agaccagtgg gtgcaagaca agatcactca 7 80 

aatgaaatac gtgactgatg ttggtgtcct gcagcgccac gtttcccgcc acaaccaccg 840 

gaacgaggat gaggagaaca cactctccgt ggactgcaca cggatctcct ttgagtatga 900 

cctccgcctg gtgctctacc agcactggtc cctccatgac agcctgtgca acaccagcta 960 

taccgcagcc aggttcaagc tgtggtctgt gcatggacag aagcggctcc aggagttcct 1020 

tgcagacatg ggtcttcccc tgaagcaggt gaagcagaag ttccaggcca tggacatctc 1080 

cttgaaggag aatttgcggg aaatgattga agagtctgca aataaatttg ggatgaagga 1140 

catgcgcgtg cagactttca gcattcattt tgggttcaag cacaagtttc tggccagcga 1200 

cgtggtcttt gccaccatgt ctttgatgga gagccccgag aaggatggct cagggacaga 1260 
tcacttcatc caggctctgg acagcctctc caggagtaac ctggacaagc tgtaccatgg 1320 
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cctggaactc 
caccaacctc 
agatgtcatg 
gtcctttgtg 
ccccctgagc 
ctcggacagg 
ccggatgctg 
caagtttctg 
cttcttattt 
tagaagccat 
aaaaaaaaaa 



gccaagaagc 
gtcatctccc 
ctgttctcta 
tgttcgacaa 
atggagcatg 
aagaactttt 
cacaaccatt 
gacgcactta 
atgtaactgg 
tttttattaa 
aaaaaaaa 



agctgcgagc 
aggggccttt 
ggccggcatc 
agaaccggcg 
gcacagtgac 
ttgggagggc 
ttgacctctc 
tttccctcct 
ctttcattta 
ataaaatgct 
1938 



cacccagcag 
cctgtactgc 
cctaagcctg 
ctgcaaactg 
cgtggtgggc 
gtttgagaag 
agtaattgag 
gtcctaggaa 
gattgtaagt 
tattttaggc 



accattgcca 
tctctcatgg 
ctcagcaaac 
ctgcccctgg 
atccccccag 
gcagcggaaa 
ctgaaagctg 
tttgattctt 
tatggacatg 
tccgtcccca 



gctgcctttg 
agggcactcc 
acctgctcaa 
tgatggctgc 
agaccgacag 
gcaccagctc 
aggatcggag 
ccagaatgac 
atttgagatg 
aaaaaaaaaa 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



<210> 124 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM.003504 
<400> 124 

caagtttctg gacgcactta tttccctcct gtcctaggaa tttgattctt ccagaatgac 

<210> 125 

<211> 2346 

<212> DNA 

<213> Homo sapiens 



60 



<300> 

<308> NM_003600 



<400> 125 

acaaggcagc 

atttaaaaag 

ggtcgcaggt 

gacggcgggg 

ctccgtccct 

ctgggcaccg 

ggaccgatct 

tccaaaacgt 

tggccaggct 

acaaaagctt 

aaccagtgta 

cctgccatcg 

agaatcaaaa 

aggaaagttt 

taaagtgtta 

agtagaaata 

tgatgctacc 

acttcagaaa 

aaatgccctg 

cttacttctt 

tccatcttcc 

tgaaggtcgg 

atttttagtt 

atcacgggtt 

aagactgttg 

ctggatcaca 

acagtcttag 

caggaacatg 

cacaagaaat 



ctcgctcgag 

aagaaccttt 

gggagccgac 

acaaggggcg 

gagtgtcctt 

aggcgccctg 

aaagaaaact 

gttctcgtga 

cagcgggtct 

gtctccagtc 

cctcatcctg 

gcacctgaaa 

aagaggcagt 

ggtaatgttt 

tttaaagctc 

cagtcccacc 

agagtctacc 

ctttcaaagt 

tcttactgtc 

ggatcagctg 

aggaggacca 

atgcatgatg 

gggaagcctc 

gaattcacat 

aagcataatc 

gcaaattcat 

gaatcgtgca 

ctactgaagt 

atttgtttta 



cgcaggccaa 

gaattctaac 

gggtgggtag 

gctggtcgga 

ggcgctgcct 

taggatactg 

gcatttcagg 

ctcagcaatt 

tgtgtccttc 

acaagccggt 

tctccaggcc 

ataatcctga 

gggctttgga 

atttggcaag 

agctggagaa 

ttcggcatcc 

taattctgga 

ttgatgagca 

attcgaagag 

gagagcttaa 

ctctctgtgg 

agaaggtgga 

cttttgaggc 

tccctgactt 

ccagccagag 

caaaaccatc 

gggggagaaa 

ttattttacc 

ctcagcaggt 



tcggctttct 

ggctgagctc 

accgtggggg 

gtggcggagc 

tgtgcccgcc 

cttgttactt 

acctgttaag 

tccttgtcag 

aaattcttcc 

tcagaatcag 

actgaataac 

ggaggaactg 

agactttgaa 

agaaaagcaa 

agccggagtg 

taatattctt 

atatgcacca 

gagaactgct 

agttattcat 

aattgcagat 

caccctggac 

tctctggagc 

aaacacatac 

tgtaacagag 

gccaatgctc 

aaattgccaa 

tccttgagcc 

attgactgct 

gtgccttaac 



agctagaggg 

ttggaagact 

atatctcagt 

gtcaagtccc 

cagcgccttt 

attacagcta 

gctacagctc 

aatccattac 

cagcgcattc 

aagcagaagc 

acccaaaaga 

gcatcaaaac 

attggtcgcc 

agcaagttta 

gagcatcagc 

agactgtatg 

cttggaacag 

acttatataa 

agagacatta 

tttgggtggt 

tacctgcccc 

cttggagttc 

caagagacct 

ggagccaggg 

agagaagtac 

aacaaagaat 

agggctgcca 

gccctcaatc 

ctccctattc 



tttaactcct 

tgggtccttg 

ggcggacgag 

ctgtcggttc 

gcatccgctc 

gaggcatcat 

cagttggagg 

ctgtaaatag 

ctttgcaagc 

aattgcaggc 

gcaagcagcc 

agaaaaatga 

ctctgggtaa 

ttctggctct 

tcagaagaga 

gttatttcca 

tttatagaga 

cagaattggc 

agccagagaa 

cagtacatgc 

ctgaaatgat 

tttgctatga 

acaaaagaat 

acctcatttc 

ttgaacaccc 

cagctagcaa 

tataacctga 

tagaacgcta 

agaaagctcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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acatcaataa acatgacact ctgaagtgaa agtagccacg agaattgtgc tacttatact 1800 

ggttcataat ctggaggcaa ggttcgactg cagccgcccc gtcagcctgt gctaggcatg 1860 

gtgtcttcac aggaggcaaa tccagagcct ggctgtgggg aaagtgacca ctctgccctg 192 0 

accccgatca gttaaggagc tgtgcaataa ccttcctagt acctgagtga gtgtgtaact 1980 

tattgggttg gcgaagcctg gtaaagctgt tggaatgagt atgtgattct ttttaagtat 2 04 0 

gaaaataaag atatatgtac agacttgtat tttttctctg gtggcattcc tttaggaatg 210 0 

ctgtgtgtct gtccggcacc ccggtaggcc tgattgggtt tctagtcctc cttaaccact 2160 

tatctcccat atgagagtgt gaaaaatagg aacacgtgct ctacctccat ttagggattt 222 0 

gcttgggata cagaagaggc catgtgtctc agagctgtta agggcttatt tttttaaaac 22 80 

attggagtca tagcatgtgt gtaaacttta aatatgcaaa taaataagta tctatgtcta 2340 
aaaaaa 2346 

<210> 126 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003600 
<400> 126 

agagtgtgaa aaataggaac acgtgctcta cc tccattta gggatttgct tgggatacag 60 

<210> 127 
<211> 853 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003641 
<400> 127 

ctagtcctga cttcacttct gatgaggaag cctctctcct tagccttcag cctttcctcc 6 0 

caccctgcca taagtaattt gatcctcaag aagttaaacc acacctcatt ggtccctggc 12 0 

taattcacca atttacaaac agcaggaaat agaaacttaa gagaaataca cacttctgag 180 

aaactgaaac gacaggggaa aggaggtctc actgagcacc gtcccagcat ccggacacca 240 

cagcggccct tcgctccacg cagaaaacca cacttctcaa accttcactc aacacttcct 3 00 

tccccaaagc cagaagatgc acaaggagga acatgaggtg gctgtgctgg gggcaccccc 3 60 

cagcaccatc cttccaaggt ccaccgtgat caacatccac agcgagacct ccgtgcccga 42 0 

ccatgtcgtc tggtccctgt tcaacaccct cttcttgaac tggtgctgtc tgggcttcat 480 

agcattcgcc tactccgtga agtctaggga caggaagatg gttggcgacg tgaccggggc 540 

ccaggcctat gcctccaccg ccaagtgcct gaacatctgg gccctgattc tgggcatcct 60 0 

catgaccatt ggattcatcc tgtcactggt attcggctct gtgacagtct accatattat 66 0 

gttacagata atacaggaaa aacggggtta ctagtagccg cccatagcct gcaacctttg 72 0 

cactccactg tgcaatgctg gccctgcacg ctggggctgt tgcccctgcc cccttggtcc 7 80 

tgcccctaga tacagcagtt tatacccaca cacctgtcta cagtgtcatt caataaagtg 840 
cacgtgcttg tga 853 

<210> 128 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003641 
<400> 128 

attatgttac agataataca ggaaaaacgg ggttactagt agccgcccat agcctgcaac 60 
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<210> 129 

<211> 1280 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003756 
<400> 129 

gaaagatggc gtcccgcaag gaaggtaccg gctctactgc cacctcttcc agctccaccg 60 

ccggcgcagc agggaaaggc aaaggcaaag gcggctcggg agattcagcc gtgaagcaag 120 

tgcagataga tggccttgtg gtattaaaga taatcaaaca ttatcaagaa gaaggacaag 180 

gaactgaagt tgttcaagga gtgcttttgg gtctggttgt agaagatcgg cttgaaatta 240 

ccaactgctt tcctttccct cagcacacag aggatgatgc tgactttgat gaagtccaat 300 

atcagatgga aatgatgcgg agccttcgcc atgtaaacat tgatcatctt cacgtgggct 3 60 

ggtatcagtc cacatactat ggctcattcg ttacccgggc actcctggac tctcagttta 420 

gttaccagca tgccattgaa gaatctgtcg ttctcattta tgatcccata aaaactgccc 480 

aaggatctct ctcactaaag gcatacagac tgactcctaa actgatggaa gtttgtaaag 540 

aaaaggattt ttcccctgaa gcattgaaaa aagcaaatat cacctttgag tacatgtttg 600 

aagaagtgcc gattgtaatt aaaaattcac atctgatcaa tgtcctaatg tgggaacttg 660 

aaaagaagtc agctgttgca gataaacatg aattgctcag ccttgccagc agcaatcatt 72 0 

tggggaagaa tctacagttg ctgatggaca gagtggatga aatgagccaa gatatagtta 780 

aatacaacac atacatgagg aatactagta aacaacagca gcagaaacat cagtatcagc 840 

agcgtcgcca gcaggagaat atgcagcgcc agagccgagg agaacccccg ctccctgagg 900 

aggacctgtc caaactcttc aaaccaccac agccgcctgc caggatggac tcgctgctca 960 

ttgcaggcca gataaacact tactgccaga acatcaagga gttcactgcc caaaacttag 102 0 

gcaagctctt catggcccag gctcttcaag aatacaacaa ctaagaaaag gaagtttcca 1080 

gaaaagaagt taacatgaac tcttgaagtc acaccagggc aactcttgga agaaatatat 1140 

ttgcatattg aaaagcacag aggatttctt tagtgtcatt gccgattttg gctataacag 12 00 

tgtctttcta gccataataa aataaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 12 6 0 
aaaaaaaaaa aaaaaaaaaa 1280 

<210> 130 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003756 
<400> 130 

tgagccaaga tatagttaaa tacaacacat acatgaggaa tactagtaaa caacagcagc 60 

<210> 131 
<211> 839 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003832 
<400> 131 

aagccacagg ctccctggct ggcgtcagct aaagtggctg ttgggtgtcc gcaggcttct 60 

gcctggccgc cgccgcctat aagctaccag gaggagcttt acgacttccc gtcctgcggg 120 

aagtggcggg cacgatcgca aggtagcgca gaagcttctc aatggccagc gccagctgca 180 

gccccggcgg cgcactcgcc tcacctgagc ctgggaggaa aattcttcca aggatgatct 240 

cccactcaga gctgaggaag cttttctact cagcagatgc tgtgtgtttt gatgttgaca 3 00 

gcacggtcat cagtgaagaa ggaatcggat gctttcattg gatttggagg aaatgtgatc 3 60 

aggcaacaag tcaaggataa cgccaaatgg tatatcactg attttgtaga gctgctggga 420 
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gaaccggaag 
aagttacaca 
tggtacagat 
gtttgttttt 
gtaatctgaa 
gagagtcaat 
tggttttttt 



aataacatcc 
gattgatact 
gatctgcact 
aactgtagtt 
ttctttatgt 
taaacacctt 
ccctttggta 



attgtcatac 
gtttgcttac 
gtcaagtaaa 
ccagtattat 
atattcctag 
tactcttagg 
tgtcaataaa 



agctccaaac 
aattgcctat 
ctacagttag 
atgatcacta 
ctatatttca 
aatatagatt 
agtttatcca 



aacttcagat 
tacaacttgc 
gaatcctcaa 
tcgatttcct 
tacaaagtgt 
cggcagcctt 
tgtgtcagaa 



gaatttttac 480 

tataaaaagt 540 

agattggttt 600 

ggagagtttt 66 0 

tttaagagtg 72 0 

cagtgaatat 7 80 

aaaaaaaaa 839 



<210> 132 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003 832 
<400> 132 

gaagaaggaa tcggatgctt tcattggatt tggaggaaat gtgatcaggc aacaagtcaa 60 

<210> 133 

<211> 3128 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003981 
<400> 133 

gcttcgcccc gtggcgcggt ttgaaatttt gcggggctca acggctcgcg gagcggctac 6 0 

gcggagtgac atcgccggtg tttgcgggtg gttgttgctc tcggggccgt gtggagtagg 12 0 

tctggacctg gactcacggc tgcttggagc gtccgccatg aggagaagtg aggtgctggc 180 

ggaggagtcc atagtatgtc tgcagaaagc cctaaatcac cttcgggaaa tatgggagct 2 40 

aattgggatt ccagaggacc agcggttaca aagaactgag gtggtaaaga agcatatcaa 3 00 

ggaactcctg gatatgatga ttgctgaaga ggaaagcctg aaggaaagac tcatcaaaag 3 60 

catatccgtc tgtcagaaag agctgaacac tctgtgcagc gagttacatg ttgagccatt 420 

tcaggaagaa ggagagacga ccatcttgca actagaaaaa gatttgcgca cccaagtgga 480 

attgatgcga aaacagaaaa aggagagaaa acaggaactg aagctacttc aagagcaaga 540 

tcaagaactg tgcgaaattc tttgtatgcc ccactatgat attgacagtg cctcagtgcc 600 

cagcttagaa gagctgaacc agttcaggca acatgtgaca actttgaggg aaacaaaggc 660 

ttctaggcgt gaggagtttg tcagtataaa gagacagatc atactgtgta tggaagaatt 720 

agaccacacc ccagacacaa gctttgaaag agatgtggtg tgtgaagacg aagatgcctt 7 80 

ttgtttgtct ttggagaata ttgcaacact acaaaagttg ctacggcagc tggaaatgca 840 

gaaatcacaa aatgaagcag tgtgtgaggg gctgcgtact caaatccgag agctctggga 900 

caggttgcaa atacctgaag aagaaagaga agctgtggcc accattatgt ctgggtcaaa 960 

ggccaaggtc cggaaagcgc tgcaattaga agtggatcgg ttggaagaac tgaaaatgca 102 0 

aaacatgaag aaagtgattg aggcaattcg agtggagctg gttcagtact gggaccagtg 1080 

cttttatagc caggagcaga gacaagcttt tgcccctttc tgtgctgagg actacacaga 1140 

aagtctgctc cagctccacg atgctgagat tgtgcggtta aaaaactact atgaagbtca 1200 

caaggaactc tttgaaggtg tccagaagtg ggaagaaacc tggaggcttt tcttagagtt 1260 

tgagagaaaa gcttcagatc caaatcgatt tacaaaccga ggaggaaatc ttctaaaaga 1320 

agaaaaacaa cgagccaagc tccagaaaat gctgcccaag ctggaagaag agttgaaggc 13 80 

acgaattgaa ttgtgggaac aggaacattc aaaggcattt atggtgaatg ggcagaaatt 1440 

catggagtat gtggcagaac aatgggagat gcatcgattg gagaaagaga gagccaagca 1500 

ggaaagacaa ctgaagaaca aaaaacagac agagacagag atgctgtatg gcagcgctcc 1560 

tcgaacacct agcaagcggc gaggactggc tcccaataca ccgggcaaag cacgtaagct 162 0 

gaacactacc accatgtcca atgctacggc caatagtagc attcggccta tctttggagg 1680 

gacagtctac cactcccccg tgtctcgact tcctccttct ggcagcaagc cagtcgctgc 1740 

ttccacctgt tcagggaaga aaacaccccg tactggcagg catggagcca acaaggagaa 1800 

cctggagctc aacggcagca tcctgagtgg tgggtaccct ggctcggccc ccctccagcg 1860 

caacttcagc attaattctg ttgccagcac ctattctgag tttgcgaagg atccgtccct 192 0 
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ctctgacagt tccactgttg ggcttcagcg agaactttca aaggcttcca aatctgatgc 19 80 

tacttctgga atcctcaatt caaccaacat ccagtcctga gaagccctga tcagtcaacc 2 040 

agctgtggct tcctgtgcct agactggacc taattatatg ggggtgactt tagtttttct 2100 

tcagcttagg cgtgcttgaa accttggcca ggttccatga ccatgggcct aacttaaaga 2160 

tgtgaatgag tgttacagtt gaaagcccat cataggttta gtggtcctag gagacttggt 2 22 0 

tttgacttat atacatgaaa agtttatggc aagaagtgca aattttagca tatggggcct 22 80 

gacttctcta ccacataatt ctacttgctg aagcatgatc aaagcttgtt ttatttcacc 2340 

actgtaggaa aatgattgac tatgcccatc cctgggggta attttggcat gtatacctgt 2 40 0 

aactagtaat taacatcttt tttgtttagg catgttcaat taatgctgta gctatcatag 2460 

ctttgctctt acctgaagcc ttgtccccac cacacaggac agccttcctc ctgaagagaa 2 52 0 

tgtctttgtg tgtccgaagt tgagatggcc tgccctactg ccaaagaggt gacaggaagg 2 580 

ctgggagcag ctttgttaaa ttgtgttcag ttctgttaca cagtgcattg ccctttgttg 2 640 

ggggtatgca tgtatgaaca cacatgcttg tcggaacgct ttctcggcgt ttgtcccttg 27 0 0 

gctctcatct cccccattcc tgtgcctact ttgcctgagt tcttctaccc ccgcagttgc 2760 

cagccacatt gggagtctgt ttgttccaat gggttgagct gtctttgtcg tggagatctg 282 0 

gaactttgca catgtcacta ctggggaggt gttcctgctc tagcttccac gatgaggcgc 2880 

cctctttacc tatcctctca atcactactc ttcttgaagc actattattt attcttccgc 2940 

tgtctgcctg cagcagtact actgtcaaca tagtgtaaat ggttctcaaa agcttaccag 3000 

tgtggacttg gtgttagcca cgctgtttac tcatacagta cgtgtcctgt ttttaaaata 3 06 0 

tacaattatt cttaaaaata aattaaaatc tgtatactta catttcaaaa agaaaaaaaa 312 0 
aaaaaaaa 312 8 



<210> 134 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_0039 81 



<400> 134 

tgcagcagta ctactgtcaa catagtgtaa atggttctca aaagcttacc agtgtggact 60 

<210> 135 

<211> 1816 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_004029 



<400> 135 

ggcacccagg gtccggcctg cgccttcccg ccaggcctgg acactggttc aacacctgtg 60 

acttcatgtg tgcgcgccgg ccacacctgc agtcacacct gtagccccct ctgccaagag 12 0 

atccataccg aggcagcgtc ggtggctaca agccctcagt ccacacctgt ggacacctgt 180 

gacacctggc cacacgacct gtggccgcgg cctggcgtct gctgcgacag gagcccttac 240 

ctcccctgtt ataacacctg accgccacct aactgcccct gcagaaggag caatggcctt 3 00 

ggctcctgag agggcagccc cacgcgtgct gttcggagag tggctccttg gagagatcag 3 60 

cagcggctgc tatgaggggc tgcagtggct ggacgaggcc cgcacctgtt tccgcgtgcc 42 0 

ctggaagcac ttcgcgcgca aggacctgag cgaggccgac gcgcgcatct tcaaggcctg 480 

ggctgtggcc cgcggcaggt ggccgcctag cagcagggga ggtggcccgc cccccgaggc 540 

tgagactgcg gagcgcgccg gctggaaaac caacttccgc tgcgcactgc gcagcacgcg 600 

tcgcttcgtg atgctgcggg ataactcggg ggacccggcc gacccgcaca aggtgtacgc 660 

gctcagccgg gagctgtgct ggcgagaagg cccaggcacg gaccagactg aggcagaggc 720 

ccccgcagct gtcccaccac cacagggtgg gcccccaggg ccattcttgg cacacacaca 7 80 

tgctggactc caagccccag gccccctccc tgccccagct ggtgacaagg gggacctcct 840 

gctccaggca gtgcaacaga gctgcctggc agaccatctg ctgacagcgt catggggggc 900 

agatccagtc ccaaccaagg ctcctggaga gggacaagaa gggcttcccc tgactggggc 960 

ctgtgctgga ggcgaggccg cggccccaga gtccccgcac caggcagagc cgtacctgtc 102 0 

accctcccca agcgcctgca ccgcggtgca agagcccagc ccaggggcgc tggacgtgac 1080 
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catcatgtac 

cctatacggc 

cagccctgcc 

cgtggcccct 

caagtgcaag 

agcctgcctg 

agagctggtg 

gggcttcggg 

gctggaaccc 

cctggatagc 

cgagtgcttc 

gaaagctgga 

cagaacaaaa 



aagggccgca cggtgctgca 
cccccagacc cagctgtccg 
gagctcccgg accagaagca 
gggttgcacc tggagcttcg 
gtgtactggg aggtgggcgg 
ctgcctcgga actgtgacac 
gaattccggg cacggcagcg 
caggacctgt cagctgggag 
tggctgtgcc gagtgcacct 
agcagcctca gcctctgcct 
cttatggagc tggagcagcc 
gcagcccacc tagagctggc 
aaaaaa 1816 



gaaggtggtg 
ggccacagac 
gctgcgctac 
ggggccacag 
acccccaggc 
ccccatcttc 
ccgtggctcc 
gcccaaggag 
agagggcacg 
gtccagcgcc 
cgcctagaac 
cgcggccgcc 



ggacacccga 
ccccagcagg 
acggaggaac 
ctgtgggccc 
tccgccagcc 
gacttcagag 
ccacgctata 
aagagcctgg 
cagcgtgagg 
aacagcctct 
ccagtctaat 
cagtctaata 



gctgcacgtt 

tagcattccc 

tgctgcggca 

ggcgcatggg 

cctccacccc 

tcttcttcca 

ccatctacct 

tcctggtgaa 

gtgtgtcttc 

atgacgacat 

gagaactcca 

aaaagaactc 



<210> 136 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004029 
<400> 136 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



agcagcccac ctagagctgg ccgcggccgc ccagtctaat aaaaagaact ccagaacaaa 



<210> 137 
<211> 2121 
<212> DNA 

<213> Homo sapiens 
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<300> 

<308> NM_004203 



<400> 137 

tggaattttt 

cctccgcccg 

ggtcccgtgg 

cgccccaggg 

cttaccgtct 

gagacggccc 

ctcagggaca 

gaggctgttg 

ccacggaggg 

gccacgcaga 

ctccgccccc 

gctggcacca 

agagccctgg 

tcagccgcct 

gccggctcta 

gcaagttggc 

tggagcaggc 



ggcgcgagca 

ggcgtccgga 

cctccgcgga 

cttccccaca 

accgggaaag 

gatcgcccag 

ccctggattc 

ggccaggcca 

caccccgcca 

acctggattc 

tgccaagggc 

gctgcagccc 

gtatgaccca 

gggccatggc 

tgcggtaaag 

cgaggtgggc 

ctgggaggag 



gctccgcgcg 

acagtcgacg 

cccggaatct 

cccacggagt 

gtggccagca 

ggccccggaa 

acgtgggagc 

gtcatgctag 

cctctgagtg 

tccctcaaga 

agcattccca 

cggcgggtgt 

agccggccag 

tcctacggag 

cgttccatgt 

agccacgaga 

ggcggcatcc 



cgttcacggg 

gcagactccg 

gggccctcgc 

gaagtcagcc 

gatgtgtcgg 

gctgcggagg 

ccctgctcct 

aacggcctcc 

gcacccccat 

ggcccagggg 

tcagccgcct 

cattccgggg 

agtccttctt 

aggtcttcaa 

caccattccg 

aggtggggca 

tgtacctgca 



ccgttccccc 

gcccgctgag 

ggacccgcgc 

gcggccctgc 

gcctggtgag 

tcacccccgc 

gcctcccccg 

tgcactggcc 

cccagtccca 

gctcagccgg 

cttccctcct 

cgaggcctca 

ccagcagagc 

ggtgcgctcc 

gggccccaag 

gcacccatgc 

gacggagctg 



tcacgggagt 

ccacccgagg 

cccgcccagt 

ctgggaggaa 

agggtgaggc 

ctggccttag 

tcccaccact 

atgcccatgc 

gcctacttcc 

agcctcccac 

cggaccccag 

gagactctgc 

ttccagaggc 

aaggaggacg 

gaccgggccc 

tgcgtgcggc 

tgcgggccca 



gcctgcagca 
acctgcggga 
atgtcaagcc 
gactgctggt 
acatggcccc 
gcctcaccat 
agcagctgcg 
gttctgtcct 
tgctggcact 



acactgtgag 
cacgctgctt 
tgccaacatc 
ggagctgggt 
cgagctgctg 
cctggaagtg 
ccagggctac 
tgtcatgatg 
gcctgtgttg 



gcctggggtg 
gccctggccc 
ttcctggggc 
acagcaggag 
cagggctcct 
gcatgcaaca 
ctgccccctg 
ctggagccag 
aggcagccgc 



ccagcctgcc 
atctgcacag 
cccggggccg 
ctggtgaggt 
atgggacagc 
tggagctgcc 
agttcactgc 
accccaagct 
gggcctgggg 



tgaggcccag 
ccagggcctg 
ctgcaagctg 
ccaggaggga 
agcggatgtg 
ccacggtggg 
cggtctgtct 
gcgggccacg 
tgtgctgtgg 



gtctggggct 
gtgcaccttg 
ggtgacttcg 
gacccccgct 
ttcagtctgg 
gagggctggc 
tccgagctgc 
gccgaggccc 
tgcatggcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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cggaggccct gagccgaggg tgggccctgt ggcaggccct gcttgccctg ctctgctggc 162 0 

tctggcatgg gctggctcac cctgccagct ggctacagcc cctgggcccg ccagccaccc 1680 

cgcctggctc accaccctgc agtttgctcc tggacagcag cctctccagc aactgggatg 1740 

acgacagcct agggccttca ctctcccctg aggctgtcct ggcccggact gtggggagca 1800 

cctccacccc ccggagcagg tgcacaccca gggatgccct ggacctaagt gacatcaact 1860 

cagagcctcc tcggggctcc ttcccctcct ttgagcctcg gaacctcctc agcctgtttg 1920 

aggacaccct agacccaacc tgagccccag actctgcctc tgcactttta accttttatc 19 80 

ctgtgtctct cccgtcgccc ttgaaagctg gggcccctcg ggaactccca tggtcttctc 2 040 

tgcctggccg tgtctaataa aaagtatttg aaccttggga gcacccaagc ttgctcatgt 2100 
ggcaaaaaaa aaaaaaaaaa a 2121 

<210> 138 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004203 
<400> 138 

ctggccgtgt ctaataaaaa gtatttgaac cttgggagca cccaagcttg ctcatgtggc 60 

<210> 139 

<211> 1982 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004207 
<400> 139 

ggcgagaggc gggctgaggc ggcccagcgg cggcaggtga ggcggaacca accctcctgg 60 

ccatgggagg ggccgtggtg gacgagggcc ccacaggcgt caaggcccct gacggcggct 12 0 

ggggctgggc cgtgctcttc ggctgtttcg tcatcactgg cttctcctac gccttcccca 180 

aggccgtcag tgtcttcttc aaggagctca tacaggagtt tgggatcggc tacagcgaca 240 

cagcctggat ctcctccatc ctgctggcca tgctctacgg gacaggtccg ctctgcagtg 3 00 

tgtgcgtgaa ccgctttggc tgccggcccg tcatgcttgt ggggggtctc tttgcgtcgc 360 

tgggcatggt ggctgcgtcc ttttgccgga gcatcatcca ggtctacctc accactgggg 42 0 

tcatcacggg gttgggtttg gcactcaact tccagccctc gctcatcatg ctgaaccgct 480 

acttcagcaa gcggcgcccc atggccaacg ggctggcggc agcaggtagc cctgtcttcc 540 

tgtgtgccct gagcccgctg gggcagctgc tgcaggaccg ctacggctgg cggggcggct 600 

tcctcatcct gggcggcctg ctgctcaact gctgcgtgtg tgccgcactc atgaggcccc 66 0 

tggtggtcac ggcccagccg ggctcggggc cgccgcgacc ctcccggcgc ctgctagacc 72 0 

tgagcgtctt ccgggaccgc ggctttgtgc tttacgccgt ggccgcctcg gtcatggtgc 7 80 

tggggctctt cgtcccgccc gtgttcgtgg tgagctacgc caaggacctg ggcgtgcccg 840 

acaccaaggc cgccttcctg ctcaccatcc tgggcttcat tgacatcttc gcgcggccgg 900 

ccgcgggctt cgtggcgggg cttgggaagg tgcggcccta ctccgtctac ctcttcagct 960 

tctccatgtt cttcaacggc ctcgcggacc tggcgggctc tacggcgggc gactacggcg 102 0 

gcctcgtggt cttctgcatc ttctttggca tctcctacgg catggtgggg gccctgcagt 1080 

tcgaggtgct catggccatc gtgggcaccc acaagttctc cagtgccatt ggcctggtgc 1140 

fcgctgatgga ggcggtggcc gtgctcgtcg ggcccccttc gggaggcaaa ctcctggatg 1200 

cgacccacgt ctacatgtac gtgttcatcc tggcgggggc cgaggtgctc acctcctccc 12 60 

tgattttgct gctgggcaac ttcttctgca ttaggaagaa gcccaaagag ccacagcctg 132 0 

aggtggcggc cgcggaggag gagaagctcc acaagcctcc tgcagactcg ggggtggact 1380 

tgcgggaggt ggagcatttc ctgaaggctg agcctgagaa aaacggggag gtggttcaca 1440 

ccccggaaac aagtgtctga gtggctgggc ggggccggca ggcacaggga ggaggtacag 150 0 

aagccggcaa cgcttgctat ttattttaca aactggactg gctcaggcag ggccacggct 156 0 

gggctccagc tgccggccca gcggatcgtc gcccgatcag tgttttgagg gggaaggtgg 162 0 

cggggtggga accgtgtcat tccagagtgg atctgcggtg aagccaagcc gcaaggttac 1680 

aaggcatcct caccaggggc cccgcctgct gctcccaggt ggcctgcggc cactgctatg 1740 

ctcaaggacc tggaaaccca tgcttcgaga caacgtgact ttaatgggag ggtgggtggg 1800 
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ccgcagacag gctggcaggg caggtgctgc gtggggccct ctccagcccg tcctaccctg 1860 

ggctcacatg gggcctgtgc ccacccctct tgagtgtctt ggggacagct ctttccaccc 192 0 

ctggaagatg gaaataaacc tgcgtgtggg tggagtgttc tcgtgccgaa ttcaaaaagc 19 80 
tt 1982 

<210> 140 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004207 
<400> 140 

cctcttgagt gtcttgggga cagctctttc cacccctgga agatggaaat aaacctgcgt 6 0 

<210> 141 

<211> 2054 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004209 
<400> 141 

cgggaggcgg cagcggctgc agcgttggta gcatcagcat cagcatcagc ggcagcggca 60 

gcggcctcgg gcggggccgg ccggacggac aggcggacag aaggcgccag gggcgcgcgt 12 0 

cccgcccggg ccggccatgg agggcgcctc cttcggcgcg ggccgcgcag gggccgccct 180 

ggaccccgtg agctttgcgc ggcggcccca gaccctgctc cgggtcgcgt cctgggtgtt 2 40 

ctccatcgcc gtcttcgggc ccatcgtcaa cgagggctac gtgaacaccg acagcggccc 300 

cgagctgcgc tgcgtgttca acgggaacgc gggcgcctgc cgcttcggcg tcgcgctggg 3 60 

cctcggagcc ttcctcgcct gcgccgcctt cctgctgctc gatgtgcgct tccagcaaat 42 0 

cagcagcgtc cgcgaccgcc ggcgcgcggt gttgctggac ctgggcttct caggactctg 4 80 

gtccttcctg tggttcgtgg gcttctgctt cctcaccaat cagtggcagc gcacggcgcc 54 0 

agggccggcc acgacgcagg cgggggacgc ggcgcgggcc gccatcgcct tcagcttctt 6 00 

ctccatcctc agctgggtgg cgctcaccgt gaaggccctg cagcggttcc gcctgggcac 6 60 

cgacatgtca ctcttcgcca ccgaacagct gagcaccggg gcgagccagg cctaccccgg 72 0 

ctatccggtg ggcagcggcg tggagggcac cgagacctac cagagcccgc ccttcaccga 7 80 

gaccctggac accagcccca aagggtacca ggtgcccgcc tactagcggc tggcaggcac 840 

agaccagggc tccaaggcca ccccaccaac gcaggcccca gggtctccgg gacctccctt 9 00 

gggtcctt cc agctcagtgc cgcggacaga gtaggtggcc gctttgcgcc atccggggcc 96 0 

aagagggggt ggacccgcgt gtctgggctg cccctgccaa gttcccccag tccctcagca 102 0 

cctggcccca ggactgaggt cctgagaagg ggatagcact gcccaggacg tgtgtcccta 1080 

gcctggaatg gactggcctg gggaaggctt tcccctcttg ggccacacct gctcactctg 1140 

gggttggggg tccagctgcc ctctacgatc aggtgcaggg gctgcccagg acaaagcggg 12 00 

ggcaggggaa agacaccacc ctcgccccaa gactggggat cctggccact gttcccatcc 12 60 

catgtccctg tgggtagtga ctgtctcgtt tctgtcatgg tggtgcgtcc cgtccggagc 1320 

cactctccac tttctctcac aggctgctag aacagcccag ccctgtcagt gttgtgatca 1380 

tggtccagtc ttcgggtttc acctcctagt actccacaag ctgctcctct ctctgtggcc 144 0 

ccggcccctg cccaggtgtg ggtggttctg gccaggaagg cacaaggtag ctgtgggcca 150 0 

agacaccagc cctgtcctag cccttcagta agaccttgcc aggagaggag aaggatgcct 1560 

gggtgccagg caagacaagc ccctcagcag gagagaggcc cagaggctcc agctggccac 162 0 

cgtgccccac aagatggccc ctgtgtggtt ccctttacct tggcttcctg gcccagtccc 1680 

tgcctctcca cctgcaccct gcttcctggc ccagtcccag gttggagtcc ctctgcatag 1740 

ctgactactc atgcattgct caaagctggc ttttcacatt aagtcaacac caaacgtggt 1800 

tgccacattt catcagacag acacctccct ctggagatgc agttgagtga caaccttgtt 1860 

acattgtagc ctagaccaat tctgtgtgga tatttaagtg aacatgttta caatttttgt 192 0 

atatatcact ctctccctct cctgaaagac cagagattgt gtattttcag tgtcccatgt 1980 
tccgactgca ccttctttac aataaagact gtaactgagc tgactgtgaa aaaaaaaaaa 2 040 
aaaaaaaaaa aaaa 2 054 
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<210> 142 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_0042 09 

gatgcagttg agtgacaacc ttgttacatt gtagcctaga ccaattctgt gtggatattt 6 0 

<210> 143 

<211> 1224 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004217 



ggccgggaga gtagcagtgc cttggacccc agctctcctc cccctttctc tcta«raatg 
gcccagaagg agaactccta cccctggccc tacggccgac agacggctoc atctggcctg 
agcaccctgc cccagcgagt cctccggaaa gagcctgtca ccccatctgc acttgtcctc 
atgagccgct ccaatgtcca gcccacagct gcccctggcc agaaggtgat ggagaatagc 
agJgggacac ccgacatctt aacgcggcac ttcacaattg atgactttga gattgggcgt 
cctctgggca aaggcaagtt tggaaacgtg tacttggctc gggagaagaa *^catttc 
atcgtggcgc tcaaggtcct cttcaagtcc cagatagaga aggagggcgt ggagcatcag 
ctgcgcagag agatcgaaat ccaggcccac ctgcaccatc ccaacatcct gcgtctctac 
aactatttt? atgaccggag gaggatctac ttgattctag agtatgcccc ccgcggggag 
ctctacaagg agctgcagaa gagctgcaca tttgacgagc agcgaacagc ^acgatcatg 
gaggagttgg cagatgctct aatgtactgc catgggaaga aggtgattca cagagacata 
aagccagaaa atctgctctt agggctcaag ggagagctga agattgctga cttcggctgg 
tctgtgcatg cgccctccct gaggaggaag acaatgtgtg gcaccctgga ctacctgccc 
ccagagatga ttgaggggcg catgcacaat gagaaggtgg atctgtggtg ^attggagtg 
ctttgctatg agctgctggt ggggaaccca ccctttgaga gtgcatcaca caacgaga c 
tatcgccgca tcgtcaaggt ggacctaaag ttccccgctt ctgtgcccac gggagcccag 
gacctcatct ccaaactgct caggcataac ccctcggaac ggctgcccct J^ccaggtc 
tcagcccacc cttgggtccg ggccaactct cggagggtgc tgcctccctc tgcccttcaa 
tctgtcgcct gatggtccct gtcattcact cgggtgcgtg tgtttgtatg tctgtgtatg 
tataggggaa agaagggatc cctaactgtt cccttatctg ttttctacct cctcctttgt 
ttaataaagg ctgaagcttt ttgt 1224 
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<210> 144 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004217 

g£ctgtgt!at gtatagggga aagaagggat ccctaactgt tcccttatct gttttctacc 60 

<210> 145 
<211> 983 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_004335 
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<400> 145 

gtggaattca 

aagcgctgta 

ggggtgccct 

cgggcagtga 

cagaagggct 

ctaatggctt 

ggagagatca 

agaagagaaa 

caggactcca 

ctgctgcagt 

tcgcttgaac 

cggggagagc 

gcagtcctgg 

cctccggaca 

gtggggggca 

tggggtcttt 

aaaaaaaaaa 



tggcatctac 
agcttctgct 
tgattatctt 
tggagtgtcg 
ttcaggatgt 
ccctggatgc 
ctacattaaa 
accaggtctt 
gctccgctgc 
gagatcccag 
attcccttga 
acggggtagc 
gtgtggggac 
atgagtcccc 
tgtgctgcct 
gagctccaaa 
aaaaaaaaaa 



ttcgtatgac 
ggggatagga 
caccatcaag 
caatgtcacc 
ggaggcccag 
agagaaggcc 
ccataagctt 
aagcgtgaga 
ggcgccccag 
gaagctggca 
tctcatcagt 
cggagaaggg 
acagtcgggt 
cctcttgtct 
gttgttatgg 
aaataaacac 
aaa 983 



tattgcagag 
attctggtgc 
gccaacagcg 
catctcctgc 
gccgccacct 
caaggacaaa 
caggacgcgt 
atcgcggaca 
ctgctgattg 
catcttggaa 
tctgagcggg 
cctctggagc 
tgacccaggg 
cccaccctga 
gttttttttg 
ttcctttgag 



tgcccatgga 
tcctgatcat 
aggcctgccg 
aacaagagct 
gcaaccacac 
agaaagtgga 
ctgcagaggt 
agaagtacta 
tgctgctggg 
ggtccgtcct 
tcatggggca 
aggtctggag 
ctgtctccct 
gattgggcat 
cggggggggt 
ggagagcaaa 



agacggggat 
cgtgattctg 
ggacggcctt 
gaccgaggcc 
tgtgatggcc 
ggagcttgag 
ggagcgactg 
ccccagctcc 
cctcagcgct 
gctcggcttt 
acacggttag 
gggccatggg 
ccagagcctc 
ggggtgcggt 
tgcttttttc 
aaaaaaaaaa 
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<210> 146 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004335 



<400> 146 

ggttgctttt ttctggggtc tttgagctcc aaaaaataaa cacttccttt gagggagagc 



60 



<210> 147 
<211> 3446 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004336 



<400> 147 

ttctagtttg 

ggaaaatgtc 

tggtgaatgg 

cttgataact 

tgacccaaga 

attttttgag 

ctgggcgggg 

aggaattcaa 

gacacgcctc 

tgttcaggtt 

catttctaag 

gtcaaatatg 

tttggcatcc 

tggggaatca 

gcatgagcaa 

tgaagaacag 

ggagacatcc 

tatggggcca 

tcagcagaca 

tttggcaaat 

tcctgttcct 

tgctggatgt 



cggttcaggt 
cttcagatgc 
gaaagataca 
ttactagaac 
ttcatcagtt 
tttctgtaca 
catctggaag 
aaccaggctg 
actgaaaccc 
ttaaatcaaa 
aatcagggtt 
gaacgaagag 
aaagttgatg 
gaattttcct 
tgggtaaatg 
ctattaaaac 
catgaggatc 
agtgtaggct 
ccagtgaaca 
gctatttctg 
ttgaaagccc 
gtgaataaga 



ttgccgctgc 
ttgaagccca 
tacagtgggt 
atttaatgaa 
attgtttaaa 
accatgggat 
cccaaggaga 
aacccagaga 
atttgccagc 
tgataacatc 
cagagctttc 
tgatcacgat 
ttgagcaggt 
ttgaagaatt 
aagacagaca 
agaaaatgga 
tgcccgcttc 
cccagcagga 
tggaaaagaa 
cagctttggt 
agacagtaac 
gtactcatga 



cggccagcgt 
catgcagagc 
agaagagaat 
ggaattttta 
atttgctgag 
tggaaccctg 
gctgcagcat 
gttcctgcaa 
tcaagctaga 
aaaatcaaat 
tggagtgata 
ttctaaatca 
tgttatgtat 
gagagcccag 
ttatatgaaa 
tgaacttcat 
ccaggaaagg 
actgagagcg 
cccaagagag 
gtccccagcc 
agactccatg 
attcaagcca 



cctctggcca 
tacaagggca 
ttfccctgaga 
gataagaaga 
tacaacagtg 
tcatcccctc 
gccagtgctg 
caacaataca 
acctcagaac 
ccaggaaata 
tcttcagctt 
gaatattctg 
tgcaaggaga 
aaatacaatc 
aggaaagaag 
aagaagttgc 
tccgaggtta 
ccatgtcttc 
gcacctcctg 
accagccaga 
tttgcagtgg 
cagagtggag 



tggacacccc 
atgaccctct 
ataaagaata 
aataccacaa 
acctccatca 
tgtacattgc 
tccttcagag 
ggttatttca 
ctctgcataa 
acatggcctg 
gtgataaaga 
tgcactcatc 
agcttattcg 
aacggagaaa 
caaatgcttt 
atcaggtggt 
atccagcacg 
cagtaaccta 
ttgttcctcc 
gcattgctcc 
ccagcaaaga 
cagagatcaa 
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agaagggtgt gaaacacata aggttgccaa 
atcactggga atggttcagg caacgccatc 
aaaagaagca ttaggtttca tcatgaatat 
tgatgacaaa gatgaatggc aatctctaga 
tcaaaaaaat gtaaggtcat ctggggcttg 
atctgctttt catgtgtttg aagatggaaa 
aaataaaccc acaggagcca ggacctttgg 
accaaaggag gaagtgcctc atgctgaaga 
tcgctgcaac aaaaccctgg cacccagtcc 
acaacttgcg tctacaccat tccacaagct 
taaagaaaat gtggtagcaa sacagtgtac 
catggtggtg ccttcaaggg atggaaaatt 
ggccttgtcg tctcacatgt attcagcatc 
tggggtactt acctgtgagg cagagttggg 
cgctgccatt gcagaagatc caccagatgc 
gatgagttca cttgggactg ttgatgctcc 
taagctgatt ttcaaacttt tatctgggct 
ttttgaatgg caatgtaaac ttccagccat 
taagctggtc tatgtccatc accttcttgg 
tacccaggga gatctgaatg atgctaaaaa 
gcctgccaac ccctgggaat tctacattgg 
tatgcagcac atgtttatga agttctattc 
agtaggagag ctctacagct atggaacatt 
ccctgaaaaa gtgatgcctc aaggtcttgt 
gattgagcaa gtgcatgact gtgaaatcat 
acttggaaac ggatttttgg aacaggatga 
gattgacctg ggtcagagta tagatatgaa 
aaagtgtgaa acatctggtt ttcagtgtgt 
ccagatcgat tactttgggg ttgctgcaac 
gaaagtgaaa aatgaaggag gagagtgtaa 
tttggatatg tggaatgaat tttttcatgt 
tccatctttg gatttgttaa ggcaaaagct 
caagattagg gccctacgta ataggctaat 
aaaataaaat ttggatatag acagtcctta 
ctttaaacct gttttttttt catttattgt 
atggaatatt tccatgtaaa aaaaaa 3 4- 



cacaagttct tttcacacaa ctccaaacac 13 80 

caaagtgcag ccatcaccca ccgtgcacac 1440 

gtttcaggct cctacacttc ctgatatttc 15 00 

tcaaaatgaa gatgcatttg aagcccagtt 15 60 

gggagtcaat aagatcatct cttctttgtc 162 0 

caaagaaaat tatggattac cacagcctaa 1680 

agaacgctct gtcagcagac ttccttcaaa 1740 

gtttttggat gactcaactg tatggggtat 1800 

taagagccca ggagacttca catctgctgc 1860 

tccagtggag tcagtgcaca ttttagaaga 192 0 

ccaggcgact ttggattctt gtgaggaaaa 1980 

cagtccaatt caagagaaaa gcccaaaaca 2 040 

cttacttcgt ctgagccagc ctgctgcagg 2100 

cgttgaggct tgcagactca cagacactga 2160 

tattgctggg ctccaagcag aatggatgca 2220 

aaacttcatt gttgggaacc catgggatga 22 80 

ttctaaacca gtgagttcct atccaaatac 2340 

caagcccaag actgaatttc aattgggttc 2 400 

agaaggagcc tttgcccagg tgtacgaagc 2460 

taaacagaaa tttgttttaa aggtccaaaa 2 520 

gacccagttg atggaaagac taaagccatc 2 5 80 

tgcccactta ttccagaatg gcagtgtatt 2 640 

attaaatgcc attaacctct ataaaaatac 27 00 

catctctttt gctatgagaa tgctttacat 2 7 60 

tcatggagac attaaaccag acaatttcat 2 820 

tgaagatgat ttatctgctg gcttggcact 2 880 

actttttcca aaaggaacta tattcacagc 2 940 

tgagatgctc agcaacaaac catggaacta 3 000 

agtatattgc atgctctttg gcacttacat 3 0 60 

gcctgaaggt ctttttagaa ggcttcctca 312 0 

tatgttgaat attccagatt gtcatcatct 3180 

gaagaaagta tttcaacaac actatactaa 3240 

tgtactgctc ttagaatgta agcgttcacg 3300 

aaaatcacac tgtaaatatg aatctgctca 33 60 

ttatgtaaat gtttgttaaa aataaatccc 342 0 
16 



<210> 148 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004336 



<400> 148 

ttagggccct acgtaatagg ctaattgtac tgctcttaga atgtaagcgt tcacgaaaat 60 



<210> 149 
<211> 739 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> N1YL.004345 



<400> 149 

taaagcaaac cccagcccac accctggcag 
cctggttggg cttttgcatc aggctcaggc 
gggaggcaga catggggacc atgaagaccc 
cactggtgct cctgctgctg ggcctggtga 



gcagccaggg atgggtggat caggaaggct 60 

tgggcataaa ggaggctcct gtgggctaga 120 

aaagggatgg ccactccctg gggcggtggt 180 

tgcctctggc catcattgcc caggtcctca 240 
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gctacaagga agctgtgctt cgtgctatag atggcatcaa ccagcggtcc tcggatgcta 
acctctaccg cctcctggac ctggacccca ggcccacgat ggatggggac ccagacacgc 
caaagcctgt gagcttcaca gtgaaggaga cagtgtgccc caggacgaca cagcagtcac 
cagaggattg tgacttcaag aaggacgggc tggtgaagcg gtgtatgggg acagtgaccc 
tcaaccaggc caggggctcc tttgacatca gttgtgataa ggataacaag agatttgccc 
tgctgggtga tttcttccgg aaatctaaag agaagattgg caaagagttt aaaagaattg 
tccagagaat caaggatttt ttgcggaatc ttgtacccag gacagagtcc tagtgtgtgc 
cctaccctgg ctcaggcttc tgggctctga gaaataaact atgagagcaa tttcaaaaaa 
aaaaaaaaaa aaaaaaaaa 739 

<210> 150 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_004345 
<400> 150 

gcaaagagtt taaaagaatt gtccagagaa tcaaggattt tttgcggaat cttgtaccca 6 

<210> 151 
<211> 1432 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_004577 
<400> 151 

gaggaaaatt cttccagcga tggtctccca ctcagagctg aggaagcttt tctactcagc 
agatgctgtg tgttttgatg ttgacagcac ggtcatcaga gaagaaggaa tcgatgagct 
agccaaaatc tgtggcgttg aggacgcggt gtcagaaatg acacggcgag ccatgggcgg 
ggcagtgcct ttcaaagctg ctctcacaga gcgcttagcc ctcatccagc cctccaggga 
gcaggtgcag agactcatag cagagcaacc cccacacctg acccccggca taagggagct 
ggtaagtcgc ctacaggagc gaaatgttca ggttttccta atatctggtg gctttaggag 
tattgtagag catgttgctt caaagctcaa tatcccagca accaatgtat ttgccaatag 
gctgaaattc tactttaacg gtgaatatgc aggttttgat gagacgcagc caacagctga 
atctggtgga aaaggaaaag tgattaaact tttaaaggaa aaatttcatt ttaagaaaat 
aatcatgatt ggagatggtg ccacagatat ggaagcctgt cctcctgctg atgctttcat 
tggatttgga ggaaatgtga tcaggcaaca agtcaaggat aacgccaaat ggtatatcac 
tgattttgta gagctgctgg gagaactgga agaataacat ccattgtcgt acagctccaa 
acaacttcag atgaattttt acaagttata cagattgata ctgtttgctt acagttgcct 
attacaactt gctatagaaa gttggtacaa atgatctgta ctttaaacta cagttaggaa 
tcctagaaga ttgctttttt ttttttttta actgtagttc cagtattata tgatgactat 
tgatttcctg gagaggtttt tttttttttt gagacagaat cttgctctgt tgcccaggct 
ggagtgcagt ggcgcggtct cggctcactg caagctctgc ctcccaggtt cacgccattc 
tcctgcctca gcctcccgag tagctgggac tacaggcacc cgccaccaca tccggctaat 
tttttgtatt tttagtagag acggggtttg accgtgttag ccaggatggt cttgatctcc 
tgaccttgtg atccgcctgc ctcagcctcc caaagtgctg ggattacagg cttgggccac 
cgcgcccagc caatgtccta gagagttttg tgatctgaat tctttatgta tatttgtagc 
tatatttcat acaaagtgct ttaagtgtgg agagtcaatt aaacaccttt actcttagaa 
atacggattc ggcagccttc agtgaatatt ggtttctctt tggtatgtca ataaaagttt 
atccgtatgt cagaacggat ttgtggaaaa aaaaaaaaaa aaaaaaaaaa aa 1432 

<210> 152 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 
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<308> NM_004577 
<400> 152 

tagaaatacg gattcggcag ccttcagtga atattggttt ctctttggta tgtcaataaa 6 0 

<210> 153 

<211> 1530 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004701 
- <400> 153 

aatcctggaa caaggctaca gcgtcgaaga tccccagcgc tgcgggctcg gagagcagtc 6 0 

ctaacggcgc ctcgtacgct agtgtcctcc cttttcagtc cgcgtccctc cctgggccgg 12 0 

gctggcactc ttgccttccc cgtccctcat ggcgctgctc cgacgcccga cggtgtccag 180 

tgatttggag aatattgaca caggagttaa ttctaaagtt aagagtcatg tgactattag 240 

gcgaactgtt ttagaagaaa ttggaaatag agttacaacc agagcagcac aagtagctaa 3 00 

gaaagctcag aacaccaaag ttccagttca acccaccaaa acaacaaatg tcaacaaaca 3 60 

actgaaacct actgcttctg tcaaaccagt acagatggaa aagttggctc caaagggtcc 42 0 

ttctcccaca cctgaggatg tctccatgaa ggaagagaat ctctgccaag ctttttctga 480 

tgccttgctc tgcaaaatcg aggacattga taacgaagat tgggagaacc ctcagctctg 540 

cagtgactac gttaaggata tctatcagta tctcaggcag ctggaggttt tgcagtccat 600 

aaacccacat ttcttagatg gaagagatat aaatggacgc atgcgtgcca tcctagtgga 660 

ttggctggta caagtccact ccaagtttag gcttctgcag gagactctgt acatgtgcgt 72 0 

tggcattatg gatcgatttt tacaggttca gccagtttcc cggaagaagc ttcaattagt 780 

tgggattact gctctgctct tggcttccaa gtatgaggag atgttttctc caaatattga 840 

agactttgtt tacatcacag acaatgctta taccagttcc caaatccgag aaatggaaac 900 

tctaattttg aaagaattga aatttgagtt gggtcgaccc ttgccactac acttcttaag 960 

gcgagcatca aaagccgggg aggttgatgt tgaacagcac actttagcca agtatttgat 102 0 

ggagctgact ctcatcgact atgatatggt gcattatcat ccttctaagg tagcagcagc 1080 

tgcttcctgc ttgtctcaga aggttctagg acaaggaaaa tggaacttaa agcagcagta 1140 

ttacacagga tacacagaga atgaagtatt ggaagtcatg cagcacatgg ccaagaatgt 12 00 

ggtgaaagta aatgaaaact taactaaatt catcgccatc aagaataagt atgcaagcag 12 60 

caaactcctg aagatcagca tgatccctca gctgaactca aaagccgtca aagaccttgc 132 0 

ctccccactg ataggaaggt cctaggctgc cgtgggccct ggggatgtgt gcttcattgt 13 8 0 

gccctttttc ttattggttt agaactcttg attttgtaca tagtcctctg gtctatctca 1440 

tgaaacctct tctcagacca gttttctaaa catatattga ggaaaaataa agcgattggt 15 00 
ttttcttaag gtaaaaaaaa aaaaaaaaaa 1530 

<210> 154 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004701 
<400> 154 

agaactcttg attttgtaca tagtcctctg gtctatctca tgaaacctct tctcagacca 60 

<210> 155 

<211> 2536 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM„004702 
<400> 155 
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agcgggtgcg gggcgggacc ggcccggcct 
gccgagcggt agctggtctg gcgaggtttt 
aagtagccgt ttacaagcta agcagcagcc 
agcccagata atccaggcca agaagaggaa 
taaacatgtt aaaaaaggag agcagatatg 
ctgacttgga accacagatg aggtccatac 
tatacacact tcatagggaa acattttatc 
tgacacaaaa ggatataaat aaaaatatgc 
ttgcttccaa acttgaggaa atctatgctc 
atggtgcttg cagtgaagag gatatcttaa 
aatgggaact ttgtcctgta acaatcatct 
ctcttaaaga tgctcctaaa gttcttctac 
tagctcagct tttagatctg tgtattctag 
tactgactgc tgctgccttg tgccatttta 
gtttggagtg ggacagtatt tcagaatgtg 
taaaaagtac tagtccagtg aagctgaaga 
ataatatcca gacacataca aactatttgg 
ccttcagaaa agggggacag ttgtcaccag 
agagcactga aaaaccacca ggaaaacact 
ttcaccaaga ttgggtagaa ctggtatcac 
ctgtgattga ttgccctagc caattcacaa 
caattggcac taaagaatac atttaattat 
gacttgttta caaagatgtc ttcattccca 
ataccatagc aatgtttttc ctttaatcca 
aattttgtca ctggagtttt ggactggata 
acagtacttt gaatctagtt gttagattct 
ttggttcttg aaaattaaat ttaaacttgt 
ctaatttatc tatagctgct atagcaagct 
aaatttaatg ttttttaaac tagtttattt 
aggcttagat gaacatggtg ttcaacctgt 
ataaacaaga taaatgctgt ggctccttcc 
ttggtaataa cctttttgta tatcacaatt 
tatttatatg aggaagtcac tttactactc 
atttagtgtg actaaggctt tatttatgtt 
ctccattgtg agataaggac agtgtcaaag 
tattttctta aggaagaaga gtattaaata 
aaaaagacat gaaaacttgc tgtacatagg 
tttctaaaaa aaaaatccag cctcataaag 
ttgttatcag atatgtgaat ctcttctccc 
tatgaagtct tctgtatagt ttgtttttaa 
aagaaaacac cactaattgt gtacatatgt 
tatttttagc ccattgttta aaaaataaaa 
aaaaaaaaaa aaaaaa 2 53 6 



atatattggg 
atacacctga 
ccagcccagc 
aactacccag 
ttcatgacaa 
ttctagactg 
ttgcacaaga 
ttcaacfccat 
ctaaactcca 
ggatggaact 
cctggctaaa 
ctcagtattc 
ccattgattc 
cctccattga 
tagattggat 
cttttaagaa 
ctatgctgga 
tgtgcaatgg 
aaagaagata 
tgaactacta 
gttacactgc 
ttcctatgtt 
aggttactgg 
gtgttactgt 
agtgctacct 
caaaattcct 
ttacaaaggt 
attataaaac 
gccttgccat 
gctctaaaca 
taactggggc 
tgggtgaaaa 
taagatatcc 
tgtgaaactg 
tgataaagct 
tatactgact 
ctagctattt 
tagattagaa 
tttgaagaaa 
actaatattt 
attatataaa 
gttaaaaaaa 



ttggcgccgg 
aagaagagaa 
cagacggaat 
gatgtcaaaa 
acattttgaa 
gcttttagag 
cttttttgat 
tggaattacc 
agagtttgct 
cattatatta 
tctctttctc 
tcaggaaaca 
attagagttc 
agtggttaag 
ggtacctttt 
gattcctatg 
ggaagtaaat 
aggcattatg 
actaagcaaa 
aagttttaca 
cattctgatt 
agctgttaaa 
atagaagcca 
gtttatcttg 
taaagggtat 
acactcttga 
ttagttttgt 
ttgaatttct 
aacacatttt 
gtgggagtac 
tttcttgaca 
acttaagtac 
ctaaggaatt 
ttaaggtcct 
taacacttga 
cctagaaatc 
ctaaatattt 
aactagattg 
ctatacattt 
gtttcagtat 
cttaaccttt 
tttaactgct 



cgccagctga 
tgtcaagacg 
ccccccaaga 
gaagtctggc 
gttctgcatt 
gtatgtgaag 
agatttatgt 
tcattattca 
tacgtcactg 
aaggctttaa 
caagttgatg 
ttcattcaaa 
cagtacagaa 
aaagcctcag 
gtcaatgtag 
gaagacagac 
tacataaaca 
acaccaccga 
caagttggaa 
gaaagtagtg 
ttaaaactta 
gaaacagcag 
accacagtct 
ataaactagg 
actaagtgat 
ctagtgcaat 
aataaggtga 
acaaatggtg 
ttaactaata 
caaagaaatt 
tgtaggttgc 
cctttcaaac 
ttttttttta 
ttctaaattc 
cctaaacttc 
tatttattaa 
taaattagct 
ctagtttatt 
attgttacgg 
tttgtctgaa 
taatactgtt 
taaaagtaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 



<210> 156 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004702 

<400> 156 ' *:n 

gtttgtgaaa ctgttaaggt cctttctaaa ttcctccatt gtgagataag gacagtgtca 6U 

<210> 157 

<211> 1491 

<212> DNA 

<213> Homo sapiens 



<300> 
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<308> NM_004710 
<400> 157 

gcggcggcgg cagcggcggc gacggcgaca tggagagcgg ggcctacggc gcggccaagg 60 

cgggcggctc cttcgacctg cggcgcttcc tgacgcagcc gcaggtggtg gcgcgcgccg 12 0 

tgtgcttggt cttcgccttg atcgtgttct cctgcatcta tggtgagggc tacagcaatg 180 

cccacgagtc taagcagatg tactgcgtgt tcaaccgcaa cgaggatgcc tgccgctatg 240 

gcagtgccat cggggtgctg gccttcctgg cctcggcctt cttcttggtg gtcgacgcgt 3 00 

atttccccca gatcagcaac gccactgacc gcaagtacct ggtcattggt gacctgctct 3 60 

tctcagctct ctggaccttc ctgtggtttg ttggtttctg cttcctcacc aaccagtggg 420 

cagtcaccaa cccgaaggac gtgctggtgg gggccgactc tgtgagggca gccatcacct 480 

tcagcttctt ttccatcttc tcctggggtg tgctggcctc cctggcctac cagcgctaca 540 

aggctggcgt ggacgacttc atccagaatt acgttgaccc cactccggac cccaacactg 600 

cctacgcctc ctacccaggt gcatctgtgg acaactacca acagccaccc ttcacccaga 660 

acgcggagac caccgagggc taccagccgc cccctgtgta ctgagcggcg gttagcgtgg 72 0 

gaagggggac agagagggcc ctcccctctg ccctggactt tcccatgagc ctcctggaac 7 80 

tgccagcccc tctctttcac ctgttccatc ctgtgcagct gacacacagc taaggagcct 840 

catagcctgg cgggggctgg cagagccaca ccccaagtgc ctgtgcccag agggcttcag 9 00 

tcagccgctc actcctccag ggcattttta ggaaagggtt ttcagctagt gtttttcctc 960 

gcttttaatg acctcagccc cgcctgcagt ggctagaagc cagcaggtgc ccatgtgcta 102 0 

ctgacaagtg cctcagcttc cccccggccc gggtcaggcc gtgggagccg ctattatctg 1080 

cgttctctgc caaagactcg tgggggccat cacacctgcc ctgtgcagcg gagccggacc 114 0 

aggctcttgt gtcctcactc aggtttgctt cccctgtgcc cactgctgta tgatctgggg 120 0 

gccaccaccc tgtgccggtg gcctctgggc tgcctcccgt ggtgtgaggg cggggctggt 1260 

gctcatggca cttcctcctt gctcccaccc ctggcagcag ggaagggctt tgcctgacaa 132 0 

cacccagctt tatgtaaata ttctgcagtt gttacttagg aagcctgggg agggcagggg 13 80 

tgccccatgg ctcccagact ctgtctgtgc cgagtgtatt ataaaatcgt gggggagatg 144 0 
cccggcctgg gatgctgttt ggagacggaa taaatgtttt ctcattcagt a 1491 

<210> 158 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004710 
<400> 158 

ttgcctgaca acacccagct ttatgtaaat attctgcagt tgttacttag gaagcctggg 6 0 

<210> 159 

<211> 3324 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004856 
<400> 159 

gcagagcacc gcgccttagc cgcgaagttc tagttcttgc tgccggtcct aacgtcccgc 60 

agtcttcgcc agccagccgt cccgcatgcg cgtttgggcg gcgtggagcc tgctgccatg 12 0 

aagtcagcga gagctaagac accccggaaa cctaccgtga aaaaagggtc ccaaacgaac 180 

cttaaagacc cagttggggt atactgtagg gtgcgcccac tgggctttcc tgatcaagag 240 

tgttgcatag aagtgatcaa taatacaact gttcagcttc atactcctga gggctacaga 3 00 

ctcaaccgaa atggagacta taaggagact cagtattcat ttaaacaagt atttggcact 3 60 

cacaccaccc agaaggaact ctttgatgtt gtggctaatc ccttggtcaa tgacctcatt 420 

catggcaaaa atggtcttct ttttacatat ggtgtgacgg gaagtggaaa aactcacaca 480 

atgactggtt ctccagggga aggagggctg cttcctcgtt gtttggacat gatctttaac 540 

agtatagggt catttcaagc taaacgatat gttttcaaat ctaatgatag gaatagtatg 600 

gatatacagt gtgaggttga tgccttatta gaacgtcaga aaagagaagc tatgcccaat 660 

ccaaagactt cttctagcaa acgacaagta gatccagagt ttgcagatat: gataactgta 720 
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caagaattct 

tatattgaaa 

ataaaaccca 

gttgcaggat 

agaggccaga 

catagcgtgt 

ttacaggaaa 

gaaagaacta 

cagtcactaa 

actaacaaga 

gatggggaag 

gaaaacttgc 

gtagacaagg 

ccagttggaa 

tcatgcgaaa 

gagaaacgac 

tttaaagctt 

gggaaactaa 

gaaaagaaaa 

tatgaggaag 

cgacagtttt 

acaatgaagt 

aataaactct 

aaaactgaga 

tctgtttctc 

cgtctccgac 

tctaacatgc 

gttgcaaatg 

gcctccgatg 

ggtggacaat 

agtcgaaaac 

tggaccgatg 

ggacctggat 

agtactgaaa 

tcttccaggt 

atacactgac 

ccttgtatag 

catcactgta 

ctaattcaca 

attttttatt 

ctgctattta 

ataaagatgt 

tgtatatatt 

gaaataaaaa 



gcaaagcaga 

tatataataa 

aacctccaca 

gtacagaagt 

aaaagagacg 

tcaacattaa 

aagaacaaat 

accggaccag 

tgacgctaag 

tggttccata 

gaaaagtgcg 

aagtcatgag 

caatatgtgg 

atgaaccatt 

ttttggatat 

ataacttacg 

tgttacaaga 

atgaaaagga 

acaaaacttt 

ataaacgcaa 

ctgacaaacg 

gggagaaaga 

gggttaaaga 

agccagagag 

catcacctgt 

acagacgatc 

aaactgaaac 

aaaaggcact 

gggagattga 

ctgttcagtt 

gaagatcttc 

tagaaacaag 

atcagcatca 

gaacattttc 

catcttgtag 

ccagagcaaa 

tatatgfcttt 

tgaatttttt 

caagtgtttg 

gtgacttctc 

ttagctgcaa 

aaatgactgg 

cactttctga 

aaaaaaaaaa 



agaggttgat 

ttacatatat 

atctaaattg 

tgaagtgaaa 

tattgctaat 

attagttcag 

cactataagt 

agcagaaggg 

aacatgtatg 

tcgagattca 

gatgatcgtg 

atttgcggaa 

tttaacgcct 

ggttactgac 

caacgatgag 

acaaatgatg 

atttgacaat 

gaagatgatc 

agaatataag 

tttgcaacag 

cagattagaa 

atgtgagcgt 

tgaaaagctg 

accctctcgg 

gcctttactc 

acgctctgca 

agtcatgcag 

agctaagtgt 

aactaaacta 

tactgatatt 

cacagtagca 

gtgttctgtg 

cgcacaaccc 

atttgtgtgg 

aactccagct 

gctttcccta 

gccatattta 

ataatgtttt 

tcttagatga 

cagccctggt 

gaatgcactt 

ccaataataa 

catttagata 

aaaa 33 2 4 



gaagatagtg 

gatctattgg 

cttcgtgaag 

tctactgagg 

acccatttga 

gctcccttgg 

cagttgtcct 

aacagattac 

gatgtcctaa 

aagttaaccc 

tgtgtgaacc 

gtgactcaag 

gggaggagat 

gtggttttgc 

cagacacttc 

attgatgagt 

gctgttttaa 

tcaggacaga 

attgagattt 

gaacttgaaa 

gccaggttgc 

agagtggcag 

aaacaactga 

gagcgagatc 

tttcaacctg 

ggagacagat 

ccacatgtcc 

gagaagtaca 

attaagggtg 

gagactttaa 

cctgcccaac 

gctgtggaga 

aagcgcaaaa 

atgatttctc 

ttgttgaaaa 

tggttccaaa 

atattaatag 

tttaaaatat 

ttaaggaaga 

ctgaatttct 

tagaactatt 

ccattttagg 

tgccaaaaga 



tctatggtgt 

aagaggtgcc 

ataagaacca 

aggcttttga 

atcgtgagtc 

atgcagatgg 

tggtagatct 

gtgaagctgg 

gagagaacca 

atctgttcaa 

ccaaggctga 

aagttgaagt 

acagaaacca 

agagttttcc 

caaggctgat 

ttaacaaaca 

gtaaagaaaa 

aattggaaat 

tagagaaaac 

ctcagaacca 

aaggcatggt 

ccaaacagct 

aggctattgt 

gagaaaaagt 

atcagaacgc 

gggtagatca 

ctcatgccat 

tgctgaccca 

atatttataa 

agcaagaatc 

cagatggtgc 

tgagagcagg 

agccatgaac 

gaaagccatg 

tcacggacct 

gacaactagt 

cagaggaaga 

atttcatgta 

ctatatctag 

taaggtttta 

tgacaattca 

aaggtgtttt 

attaaaatca 



atttgtctct 

gtttgatccc 

taacatgtat 

agttttctgg 

cagccgttcc 

agacaatgtc 

tgctggaagt 

taatattaat 

aatgtatgga 

gaactacttt 

agattatgaa 

agcaagacct 

gcctcgaggt 

acctttgccg 

tgaagcctta 

atctaatgct 

ccacatgcaa 

agaacgactg 

aactactatc 

gaaacttcag 

gacagaaacg 

ggagatgcag 

tactgaacct 

tactcaaaga 

accaccaatt 

taagcccgcc 

cacagtatct 

ccaggaacta 

aacaaggggt 

accaaatggt 

agagtctgaa 

atcccagctg 

tgacagtccc 

ccagaagcag 

cagctacatc 

attcaacaaa 

ctcctttttt 

tacttataaa 

atcatgtctg 

taaacaaatg 

gactttcaaa 

gaattctgta 

aaagcactaa 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 



<210> 160 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004856 



<400> 160 

caaagctttc cctatggttc aaagacaact agtattcaac aaaccttgta tagtgtatgt 



60 



<210> 161 

<211> 1536 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> NM_004900 
<400> 161 

acagagcttc aaaaaaagag cgggacaggg acaagcgtat ctaagaggct gaacatgaat 60 

ccacagatca gaaatccgat ggagcggatg tatcgagaca cattctacga caactttgaa 120 

aacgaaccca tcctctatgg tcggagctac acttggctgt gctatgaagt gaaaataaag 180 

aggggccgct caaatctcct ttgggacaca ggggtctttc gaggccaggt gtatttcaag 2 40 

cctcagtacc acgcagaaat gtgcttcctc tcttggttct gtggcaacca gctgcctgct 3 00 

tacaagtgtt tccagatcac ctggtttgta tcctggaccc cctgcccgga ctgtgtggcg 3 60 

aagctggccg aattcctgtc tgagcacccc aatgtcaccc tgaccatctc tgccgcccgc 420 

ctctactact actgggaaag agattaccga agggcgctct gcaggctgag tcaggcagga 480 

gcccgcgtga cgatcatgga ctatgaagaa tttgcatact gctgggaaaa ctttgtgtac 540 

aatgaaggtc agcaattcat gccttggtac aaattcgatg aaaattatgc attcctgcac 600 

cgcacgctaa aggagattct cagatacctg atggatccag acacattcac tttcaacttt 6 60 

aataatgacc ctttggtcct tcgacggcgc cagacctact tgtgctatga ggtggagcgc 720 

ctggacaatg gcacctgggt cctgatggac cagcacatgg gctttctatg caacgaggct 7 80 

aagaatcttc tctgtggctt ttacggccgc catgcggagc tgcgcttctt ggacctggtt 840 

ccttctttgc agttggaccc ggcccagatc tacagggtca cttggttcat ctcctggagc 900 

ccc tgcttct cctggggctg tgccggggaa gtgcgtgcgt tccttcagga gaacacacac 960 

gtgagactgc gcatcttcgc tgcccgcatc tatgattacg accccctata taaggaggcg 102 0 

ctgcaaatgc tgcgggatgc tggggcccaa gtctccatca tgacctacga tgagtttgag 1080 

tactgctggg acacctttgt gtaccgccag ggatgtccct tccagccctg ggatggacta 1140 

gaggagcaca gccaagccct gagtgggagg ctgcgggcca ttctccagaa tcagggaaac 1200 

tgaaggatgg gcctcagtct ctaaggaagg cagagacctg ggttgagcag cagaataaaa 126 0 

gatcttcttc caagaaatgc aaacagaccg ttcaccacca tctccagctg ctcacagaca 132 0 

ccagcaaagc aatgtgctcc tgatcaagta gattttttaa aaatcagagt caattaattt 1380 

taattgaaaa tttctcttat gttccaagtg tacaagagta agattatgct caatattccc 1440 

agaatagttt tcaatgtatt aatgaagtga ttaattggct ccatatttag actaataaaa 150 0 
cattaagaat cttccataat tgtttccaca aacact 153 6 

<210> 162 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004900 
<400> 162 

tgctcacaga caccagcaaa gcaatgtgct cctgatcaag tagatttttt aaaaatcaga 6 0 

<210> 163 

<211> 1722 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004988 
<400> 163 

cgtagagttc ggccgaagga acctgaccca ggctctgtga ggaggcaagg ttttcagggg 60 

acaggccaac ccagaggaca ggattccctg gaggccacag aggagcacca aggagaagat 12 0 

ctgcctgtgg gtcttcattg cccagctcct gcccacactc ctgcctgctg ccctgacgag 180 

agtcatcatg tctcttgagc agaggagtct gcactgcaag cctgaggaag cccttgaggc 240 

ccaacaagag gccctgggcc tggtgtgtgt gcaggctgcc gcctcctcct cctctcctct 3 00 

ggtcctgggc accctggagg aggtgcccac tgctgggtca acagatcctc cccagagtcc 3 60 

tcagggagcc tccgcctttc ccactaccat caacttcact cgacagaggc aacccagtga 420 

gggttccagc agccgtgaag aggaggggcc aagcacctct tgtatcctgg agtccttgtt 480 

ccgagcagta atcactaaga aggtggctga tttggttggt tttctgctcc tcaaatatcg 540 

agccagggag ccagtcacaa aggcagaaat gctggagagt gtcatcaaaa attacaagca 600 
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ctgttttcct gagatcttcg gcaaagcctc tgagtccttg cagctggtct ttggcattga 660 

cgtgaaggaa gcagacccca ccggccactc ctatgtcctt gtcacctgcc taggtctctc 72 0 

ctatgatggc ctgctgggtg ataatcagat catgcccaag acaggcttcc tgataattgt 7 80 

cctggtcatg attgcaatgg agggcggcca tgctcctgag gaggaaatct gggaggagct 840 

gagtgtgatg gaggtgtatg atgggaggga gcacagtgcc tatggggagc ccaggaagct 900 

gctcacccaa gatttggtgc aggaaaagta cctggagtac cggcaggtgc cggacagtga 9 60 

tcccgcacgc tatgagttcc tgtggggtcc aagggccctt gctgaaacca gctatgtgaa 1020 

agtccttgag tatgtgatca aggtcagtgc aagagttcgc tttttcttcc catccctgcg 1080 

tgaagcagct ttgagagagg aggaagaggg agtctgagca tgagttgcag ccagggccag 1140 

tgggaggggg actgggccag tgcaccttcc agggccgcgt ccagcagctt cccctgcctc 12 00 

gtgtgacatg aggcccattc ttcactctga agagagcggt cagtgttctc agtagtaggt 12 60 

ttctgttcta ttgggtgact tggagattta tctttgttct cttttggaat tgttcaaatg 1320 

ttttttttta agggatggtt gaatgaactt cagcatccaa gtttatgaat gacagcagtc 13 80 

acacagttct gtgtatatag tttaagggta agagtcttgt gttttattca gattgggaaa 1440 

tccattctat tttgtgaatt gggataataa cagcagtgga ataagtactt agaaatgtga 1500 

aaaatgagca gtaaaataga tgagataaag aactaaagaa attaagagat agtcaattct 1560 

tgccttatac ctcagtctat tctgtaaaat ttttaaagat atatgcatac ctggatttcc 162 0 

ttggcttctt tgagaatgta agagaaatta aatctgaata aagaattctt cctgttaaaa 1680 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 1722 

<210> 164 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004988 
<400> 164 

cagattggga aatccattct attttgtgaa ttgggataat aacagcagtg gaataagtac 60 

<210> 165 

<211> 2334 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004994 
<400> 165 

agacacctct gccctcacca tgagcctctg gcagcccctg gtcctggtgc tcctggtgct 60 

gggctgctgc tttgctgccc ccagacagcg ccagtccacc cttgtgctct tccctggaga 12 0 

cctgagaacc aatctcaccg acaggcagct ggcagaggaa tacctgtacc gctatggtta 180 

cactcgggtg gcagagatgc gtggagagtc gaaatctctg gggcctgcgc tgctgcttct 2 40 

ccagaagcaa ctgtccctgc ccgagaccgg tgagctggat agcgccacgc tgaaggccat 3 00 

gcgaacccca cggtgcgggg tcccagacct gggcagattc caaacctttg agggcgacct 3 60 

caagtggcac caccacaaca tcacctattg gatccaaaac tactcggaag acttgccgcg 42 0 

ggcggtgatt gacgacgcct ttgcccgcgc cttcgcactg tggagcgcgg tgacgccgct 480 

caccttcact cgcgtgtaca gccgggacgc agacatcgtc atccagtttg gtgtcgcgga 540 

gcacggagac gggtatccct tcgacgggaa ggacgggctc ctggcacacg cctttcctcc 600 

tggccccggc attcagggag acgcccattt cgacgatgac gagttgtggt ccctgggcaa 660 

gggcgtcgtg gttccaactc ggtttggaaa cgcagatggc gcggcctgcc acttcccctt 720 

catcttcgag ggccgctcct actctgcctg caccaccgac ggtcgctccg acggcttgcc 7 80 

ctggtgcagt accacggcca actacgacac cgacgaccgg tttggcttct gccccagcga 840 

gagactctac acccgggacg gcaatgctga tgggaaaccc tgccagtttc cattcatctt 900 

ccaaggccaa tcctactccg cctgcaccac ggacggtcgc tccgacggct accgctggtg 960 

cgccaccacc gccaactacg accgggacaa gctcttcggc ttctgcccga cccgagctga 1020 

ctcgacggtg atggggggca actcggcggg ggagctgtgc gtcttcccct tcactttcct 1080 

gggtaaggag tactcgacct gtaccagcga gggccgcgga gatgggcgcc tctggtgcgc 1140 

taccacctcg aactttgaca gcgacaagaa gtggggcttc tgcccggacc aaggatacag 1200 



75 



WO 2005/086891 



PCT/US2005/007894 



tttgttcctc gtggcggcgc atgagttcgg ccacgcgctg ggcttagatc attcctcagt 12 60 

gccggaggcg ctcatgtacc ctatgtaccg cttcactgag gggcccccct tgcataagga 1320 

cgacgtgaat ggcatccggc acctctatgg tcctcgccct gaacctgagc cacggcctcc 13 80 

aaccaccacc acaccgcagc ccacggctcc cccgacggtc tgccccaccg gaccccccac 1440 

tgtccacccc tcagagcgcc ccacagctgg ccccacaggt cccccctcag ctggccccac 1500 

aggtcccccc actgctggcc cttctacggc cactactgtg cctttgagtc cggtggacga 15 60 

tgcctgcaac gtgaacatct tcgacgccat cgcggagatt gggaaccagc tgtatttgtt 1620 

caaggatggg aagtactggc gattctctga gggcaggggg agccggccgc agggcccctt 1680 

ccttatcgcc gacaagtggc ccgcgctgcc ccgcaagctg gactcggtct ttgaggagcc 1740 

gctctccaag aagcttttct tcttctctgg gcgccaggtg tgggtgtaca caggcgcgtc 1800 

ggtgctgggc ccgaggcgtc tggacaagct gggcctggga gccgacgtgg cccaggtgac 1860 

cggggccctc cggagtggca gggggaagat gctgctgttc agcgggcggc gcctctggag 1920 

gttcgacgtg aaggcgcaga tggtggatcc ccggagcgcc agcgaggtgg accggatgtt 1980 

ccccggggtg cctttggaca cgcacgacgt cttccagtac cgagagaaag cctatttctg 2 040 

ccaggaccgc ttctactggc gcgtgagttc ccggagtgag ttgaaccagg tggaccaagt 2100 

gggctacgtg acctatgaca tcctgcagtg ccctgaggac tagggctccc gtcctgcttt 2160 

gcagtgccat gtaaatcccc actgggacca accctgggga aggagccagt ttgccggata 2220 

caaactggta ttctgttctg gaggaaaggg aggagtggag gtgggctggg ccctctcttc 22 80 
tcacctttgt tttttgttgg agtgtttcta ataaacttgg attctctaac cttt 2334 

<210> 166 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_004994 
<400> 166 

ggccctctct tctcaccttt gttttttgtt ggagtgtttc taataaactt ggattctcta 6 0 

<210> 167 

<211> 5329 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> Modif ied„base 
<222> 1 ... 5329 
<223> n = a.,c,g, or t 

<300> 

<308> NM_005063 
<400> 167 

gtggtgtcgg tgtcggcagc atccccggcg ccctgctgcg gtcgccggag ccctcggcct 60 

ctgttctcct ccccctcccg cccttacctc cacgcgggac cgcccgcgcc agtcaactcc 12 0 

tcgcactttg cccctgcttg gcagcggata aaagggggct gaggaaatac cggacacgtc 180 

cacccgttgc cagctctagc ctttaaattc ccggctcggg acctccacgc accgggctag 2 40 

cgccgacaac cagctagcgt gcaaggcgcc gcggctcagc gcgtaccggc gggcttcgaa 3 00 

accgcagtcc tccggcgacc ccgaactccg ctccggagcc tcagccccct ggaaagtgat 3 60 

cccggcatcg gagagccaag atgccggccc acttgctgca ggacgatatc tctagctcct 42 0 

ataccaccac caccaccatt acagcgcctc cc tccagggt cctgcagaat ggaggagata 480 

agttggagac gafcgcccctc tacttggaag acgacattcg ccctgatata aaagatgata 540 

tatatgaccc cacctacaag gataaggaag gcccaagccc caaggttgaa tatgtctgga 60 0 

gaaacatcat ccttatgtct ctgctacact tgggagccct gtatgggatc actttgattc 660 

ctacctgcaa gttctacacc tggctttggg gggtattcta ctattttgtc agtgccctgg 72 0 

gcataacagc aggagctcat cgtctgtgga gccaccgctc ttacaaagct cggctgcccc 7 80 

tacggctctt tctgatcatt gccaacacaa tggcattcca gaatgatgtc tatgaatggg 840 

ctcgtgacca ccgtgcccac cacaagtttt cagaaacaca tgctgatcct cataattccc 900 
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gacgtggctt 
aagagaaggg 
agaggaggta 
cctggtattt 
atgctgtggt 
gtccttatga 
gtgagggctt 
gctggcacat 
atgaccggaa 
gaaactacaa 
gcagaggttt 
atgatgttaa 
aactctgcct 
ggcccattgt 
gcagctggtc 
ctaaaacgaa 
agtaggtggc 
ttcagcccag 
aaagcnaggt 
cgaaggaacc 
ccaccacata 
ggcaatgcta 
taagtttgta 
taacaaggag 
tggtaaaaac 
tcctctttct 
ctgtggcatt 
aaataaaata 
ttccaaagag 
atttcctctc 
ttctaagagc 
tactgagtga 
tccagtttct 
ttcactcatt 
aactgcctgg 
taccctgtct 
ccctggacag 
agctccctcc 
agagctagct 
cctgccacac 
agaaagcctc 
ggccagccac 
agggcatgct 
cagctggcag 
gggcaagaac 
ctgcaaagct 
tctgttcagc 
tggaaaaaaa 
catcaaagca 
cagtccactg 
gtagaggcta 
agtagaaaag 
cccttggaaa 
tatataaaca 
atctgtacca 
gtgtggctgt 
tctaaagctc 
tttctaacag 
tttctaatca 
cctgagaccc 
cacgatttgc 



tttcttctct 
gagtacgcta 
ctacaaacct 
ctggggtgaa 
gcttaatgcc 
caagaacatt 
ccacaactac 
caacttcacc 
gaaagtctcc 
gagtggctga 
taatgtctgt 
cccattccag 
ttatgatgct 
cctccttttc 
agtctttgct 
tggtctttgc 
ttgagctaga 
gcttttgcta 
aaattgtcgg 
tctcgccatg 
gcatgcttcc 
attcaatgcc 
gtaaaagtgg 
atttcttagt 
agcagctcat 
gctctgccat 
tccaaacatc 
tatatacata 
ggatgtttgg 
cactgctgga 
gaacttcaca 
gttattggga 
ccttgaaatg 
tttttttgag 
ggggcagggt 
gtcccttttc 
gcagcctcct 
tgcacacaga 
gccactttca 
tgatggctca 
tctctccctc 
agagcaaaag 
gaatgccccc 
tgcaatgctt 
ccaagtgcct 
ttggctgaga 
aaactaacca 
caaaaacaga 
gaagtctggc 
ttgagcccca 
caggggttag 
gatctaaggg 
tgtctgctgg 
ataaaaacaa 
tgaccctaca 
gggtgtgggt 
aattcaagtg 
tccttgcttt 
cagtgttgca 
tactgaagtg 
aggattccct 



cacgtgggtt 
gacttgtctg 
ggcttgctgc 
acttttcaaa 
acctggctgg 
agcccccggg 
caccactcct 
acattcttca 
aaggccgcca 
gtttggggtc 
ttattaacta 
tacagtattc 
aagctgatat 
actttaatcg 
cagtgtccag 
tccagataac 
gataaaacag 
gatggaatgg 
gggagagagt 
atcagacata 
tttctctcct 
gcaacatata 
tctctgctgg 
tcatatatca 
agaattttga 
cttcaggata 
caaaaaaagg 
tatacattgc 
aaaaaactct 
catgagatgg 
taggaaggga 
gtcttaataa 
agtaaaaact 
gagaaggggg 
taggaatctc 
tttgaccaga 
ttgtgtgtat 
atgctcaggg 
cgtggcctcc 
agacaaggct 
cctctctcat 
agggtttatt 
tgcttacttg 
gtagaagtag 
cacctcgaaa 
cctgggattt 
gcattcccta 
atttgagaac 
tttgctctat 
gtgctggaag 
cctggactaa 
aagatcactg 
tatttctaat 
ctttcacttc 
taaggctgga 
gggagtgtgt 
acacattaat 

ggggggtgtg 
tatgagcctg 
gctggtagaa 
tctgggcttc 



ggctgcttgt 
acctagaagc 
tgatgtgctt 
acagtgtgtt 
tgaacagtgc 
agaatatcct 
ttccctatga 
ttgattgcat 
tcttggccag 
cctcaggttc 
ctgaataatg 
ttttaaaatt 
tatttcttct 
ccctcctttc 
cttccaaagc 
tctctttcct 
aatcttctgg 
aaaagcaact 
tagcatgtat 
cagctgccta 
ggctcggggt 
gttgaggccg 
ggaagggttt 
agaagtcttg 
gtattccatg 
ttggttcttc 
gaaggattta 
ttagaacgtt 
gaaggagagg 
agaggctgag 
tctgagaaca 
actagatatt 
agaaggcttc 
tctctgttaa 
ttcactaccc 
tctttctctt 
tcagaggcag 
tcactgaacc 
gcagtgtctc 
ggcaaaccct 
gagaagccaa 
ttcagtcccc 
gtgagggtgc 
gaggaaacag 
ggaggccctg 
gagataccac 
cagcctaggg 
cttggaccac 
taagattgga 
ggaggaaggc 
aggcatcctt 
tagtttagtt 
tccacaggtc 
ttcctattgt 
tggcacctca 
ctgctgagta 
gataaactca 
ctggcaactt 
ccctcactcc 
aaaggggcct 
attctggaaa 



gcgcaaacac 
tgagaaactg 
catcctgccc 
cgttgccact 
tgcccacctc 
ggtttcactt 
ctactctgcc 
ggccgccctc 
gattaaaaga 
ctttttcaaa 
ctaccaggat 
caaaagtatt 
cttatcctct 
ccttattgcc 
ctagacaacc 
tgagctgttg 
gtagtcccct 
tcatttgaca 
gaatgtaagg 
cctaatgagg 
aaaaagtggc 
aggataaaga 
tcttfctcttt 
aagttgggtg 
agctgctcat 
ccctcatagt 
aggaggtgaa 
aaactattag 
aggaattagt 
ggacaggatc 
cgttcagggg 
aggtccattc 
tctccacagt 
catctagcct 
tgattcttga 
ccctgaacgt 
tgatgacttg 
actgcttctc 
cacctacacc 
cccagaaaca 
gcgctcatgt 
tctctctggg 
cccgcctgag 
ttctcactgg 
ttccctggag 
aaaccctgct 
cagacaatag 
tcctgtccct 
aatgtacact 
ctttcttctg 
gtctttgagc 
ctgttgacct 
atcagatgcc 
aatcgtgtgc 
ggctgagggc 
aggaacacga 
gatctgatca 
agctcaggtg 
ctctgcagaa 
gagtggagga 
cttttgttag 



ccagctgtca 
gtgafcgttcc 
acgcttgtgc 
ttcttgcgat 
ttcggatatc 
ggagctgtgg 
agtgagtacc 
ggtctggcct 
accggagatg 
aaccagccag 
gctaaagatg 
gaaagccaac 
ctctcttcta 
tcccaggcaa 
tttctgtagc 
tgagctttga 
gttgattatc 
caaagcttct 
atgagggaag 
acttcaagcc 
tgcggtgttt 
aaagacattt 
ttttctttaa 
tttccagaat 
tacagttctt 
aataagatgg 
gtcgggtcaa 
agtatttccc 
tgggatgcca 
tataggcagc 
ttgagaaggt 
attaattagt 
gttgtgcccc 
aaagtataca 
ttcctggctc 
tttcttcttt 
ctgtccaggc 
ttttgaaagt 
cctgtgctcc 
tctctggccc 
tgagccagtg 
tcagaaccag 
tcagtgctct 
gaagaagcaa 
tcagggtgaa 
gaacacagtg 
tatagaagtc 
gtagctcagt 
accaaacact 
tgttaattgc 
tattcacctc 
gtgcacctac 
tgcttgataa 
catggatctg 
cccaatgtat 
ttttcaagat 
agagtccgga 
ccttacatct 
tccctttgca 
ttatcagtat 
ggctgctttt 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 
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cttaagtgcc cacatttgat ggagggtgga aataatttga atgtatttga tttataagtt 462 0 

tttttttttt tttgggttaa aagatggttg tagcatttaa aatggaaaat tttctccttg 4680 

gtttgctagt atcttgggtg tattctctgt aagtgtagct caaataggtc atcatgaaag 4740 

gttaaaaaag cgaggtggcc atgttatgct ggtggttgcc agggcctcca accactgtgc 4800 

cactgacttg ctgtgtgacc ctgggcaagt cacttaacta taaggtgcct cagttttcct 4860 

tctgttaaaa tggggataat aatactgacc tacctcaaag ggcagttttg aggcatgact 492 0 

aatgcttttt agaaagcatt ttgggatcct tcagcacagg aattctcaag acctgagtat 4980 

tttttataat aggaatgtcc accatgaact tgatacgtcc gtgtgtccca gatgctgtca 5040 

ttagtctata tggttctcca agaaactgaa tgaatccatt ggagaagcgg tggataacta 5100 

gccagacaaa atttgagaat acataaacaa cgcattgcca cggaaacata cagaggatgc 5160 

cttttctgtg attgggtggg attttttccc tttttatgtg ggatatagta gttacttgtg 522 0 

acaagaataa ttttggaata atttctatta atatcaactc tgaagctaat tgtactaatc 52 80 
tgagattgtg tttgttcata ataaaagtga agtgaatctg attgcactg 5329 

<210> 168 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005063 
<400> 168 

aataatgcta ccaggatgct aaagatgatg atgttaaccc attccagtac agtattcttt 60 

<210> 169 
<211> 634 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_005101 
<400> 169 

cggctgagag gcagcgaact catctttgcc agtacaggag cttgtgccgt ggcccacagc 6 0 

ccacagccca cagccatggg ctgggacctg acggtgaaga tgctggcggg caacgaattc 12 0 

caggtgtccc tgagcagctc catgtcggtg tcagagctga aggcgcagat cacccagaag 180 

attggcgtgc acgccttcca gcagcgtctg gctgtccacc cgagcggtgt ggcgctgcag 240 

gacagggtcc cccttgccag ccagggcctg ggccctggca gcacggfccct gctggtggtg 3 00 

gacaaatgcg acgaacctct gagcatcctg gtgaggaata acaagggccg cagcagcacc 3 60 

tacgaggtcc ggctgacgca gaccgtggcc cacctgaagc agcaagtgag cgggctggag 42 0 

ggtgtgcagg acgacctgtt ctggctgacc ttcgagggga agcccctgga ggaccagctc 480 

ccgctggggg agtacggcct caagcccctg agcaccgtgt tcatgaatct gcgcctgcgg 540 

ggaggcggca cagagcctgg cgggcggagc taagggcctc caccagcatc cgagcaggat 600 

caagggccgg aaataaaggc tgttgtaaga gaat 634 

<210> 170 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005101 
<400> 170 

tggtggtgga caaatgcgac gaacctctga gcatcctggt gaggaataac aagggccgca 60 



<210> 171 

<211> 1339 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> NM_0 05139 
<400> 171 

gaattccgat tagtgtgatc tcagctcaag gcaaaggtgg gatatcatgg catctatctg 60 

ggttggacac cgaggaacag taagagatta tccagacttt agcccatcag tggatgctga 120 

agctattcag aaagcaatca gaggaattgg aactgatgag aaaatgctca tcagcattct 180 

gactgagagg tcaaatgcac agcggcagct gattgttaag gaatatcaag cagcatatgg 240 

aaaggagctg aaagatgact tgaagggtga tctctctggc cactttgagc atctcatggt 3 00 

ggccctagtg actccaccag cagtctttga tgcaaagcag ctaaagaaat ccatgaaggg 3 60 

cgcgggaaca aacgaagatg ccttgattga aatcttaact accaggacaa gcaggcaaat 42 0 

gaaggatatc tctcaagcct attatacagt atacaagaag agtcttggag atgacattag 480 

ttccgaaaca tctggtgact tccggaaagc tctgttgact ttggcagatg gcagaagaga 540 

tgaaagtctg aaagtggatg agcatctggc caaacaagat gcccagattc tctataaagc 600 

tggtgagaac agatggggca cggatgaaga caaattcact gagatcctgt gtttaaggag 660 

ctttcctcaa ttaaaactaa catttgatga atacagaaat atcagccaaa aggacattgt 72 0 

ggacagcata aaaggagaat tatctgggca ttttgaagac ttactgttgg ccatagttaa 7 80 

ttgtgtgagg aacacgccgg cctttttagc cgaaagactg catcgagcct tgaagggtat 840 

tggaactgat gagtttactc tgaaccgaat aatggtgtcc agatcagaaa ttgacctttt 900 

ggacattcga acagagttca agaagcatta tggctattcc ctatattcag caattaaatc 960 

ggatacttct ggagactatg aaatcacact cttaaaaatc tgtggtggag atgactgaac 102 0 

caagaagata atctccaaag gtccacgatg ggctttccca acagctccac cttacttctt 1080 

ctcatactat ttaagagaac aagcaaatat aaacagcaac ttgtgttcct aacaggaatt 114 0 

ttcattgttc tataacaaca acaacaaaag cgattattat tttagagcat ctcatttata 1200 

atgtagcagc tcataaatga aattgaaaat ggtattaaag atctgcaact actatccaac 12 6 0 

ttatatttct gctttcaaag ttaagaatct ttatagttct actccattaa atataaagca 132 0 
agataataaa acggaattc 1339 

<210> 172 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> 3KIM_005139 
<400> 172 

ttcagcaatt aaatcggata cttctggaga ctatgaaatc acactcttaa aaatctgtgg 6 0 

<210> 173 

<211> 1582 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005165 
<400> 173 

ccgagctgtg cttgtggctg cggctgctaa ctggctgcgc acagggagct gtcaccatgc 60 

ctcactcgta cccagccctt tctgctgagc agaagaagga gttgtctgac attgccctgc 12 0 

ggattgtagc cccgggcaaa ggcattctgg ctgcggatga gtctgtaggc agcatggcca 180 

agcggctgag ccaaattggg gtggaaaaca cagaggagaa ccgccggctg taccgccagg 2 40 

tcctgttcag tgctgatgac cgtgtgaaaa agtgcattgg aggcgtcatt ttcttccatg 3 00 

agaccctcta ccagaaagat gahaatggtg ttcccttcgt ccgaaccatc caggataagg 3 60 

gcatcgtcgt gggcatcaag gttgacaagg gtgtggtgcc tctagctggg actgatggag 42 0 

aaaccaccac tcaagggctg gafcgggctct cagaacgctg tgcccaatac aagaaggatg 480 

gtgctgactt tgccaagtgg cgctgtgtgc tgaaaatcag tgagcgtaca ccctctgcac 540 

ttgccattct ggagaacgcc aacgtgctgg cccgttatgc cagtatctgc cagcagaatg 6 00 

gcattgtgcc tattgtggaa cctgaaatat tgcctgatgg agaccacgac ctcaaacgtt 660 

gtcagtatgt tacagagaag gtcttggctg ctgtgtacaa ggccctgagt gaccatcatg 72 0 
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tatacctgga 
tcaagtatac 
ccccagctgt 
tcaacctcaa 
atgggcgtgc 
gggctgccac 
agtatgaagg 
atgcctactg 
tgcttgtagt 
ccaacttgtc 
ggatgggagg 
caggatggct 
gcttctccct 
aagccctgac 
aaaaaaaaaa 



ggggaccctg 
cccagaggag 
cccaggagtg 
tgccatcaac 
cctgcaagcc 
tgaggagttc 
cagtggagaa 
agtatccact 
catggccagg 
ttcctttctc 
atagggagcc 
gggtctcccc 
gggctctcct 
tcatgccttg 
aaaaaaaaaa 



ctcaagccca 
attgccatgg 
accttcctgt 
cgctgccccc 
tctgcactca 
atcaagcggg 
gatggtggag 
ccataccaca 
gccaaatagc 
tcttcccttc 
cctcatgact 
ctacctcttc 
tcttgcctgc 
agtacatacc 
aa 1582 



acatggtgac 
caactgtcac 
ctgggggtca 
ttccccgacc 
atgcctggcg 
ctgaggtgaa 
cagcagcaca 
gcccttggcc 
tatgcagagc 
ccctctctca 
gagggcagaa 
cagctcccac 
cctgtctcct 
atacagcaaa 



cccgggccat 

tgccctgcgt 

gagcgaagaa 

ctgggcgctt 

agggcaacgg 

tgggcttgca 

gtcactctac 

cagccatctg 

agagatgcct 

ttgctgcacc 

gaaattgcta 

aattttccca 

gggatcagag 

taaatggtag 



gcctgtccca 
cgcactgtgc 
gaggcatcat 
accttctcct 
gacaatgctg 
gcccagggca 
attgccaacc 
cacccacttt 
tcacctggca 
tgggaccata 
gaagtcagaa 
tgatgaggta 
ggtagtacag 
caaaacaaaa 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 



<210> 174 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_005165 



<400> 174 , cn 

gagggtagta cagaagccct gactcatgcc ttgagtacat accatacagc aaataaatgg 6 0 

<210> 175 
<211> 451 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> NM_005213 



<400> 175 

acttccctgt tcactttggt tccagcatcc 
atacctggag gcttatctga ggccaaaccc 
aaggttaaac cacagcttga agaaaaaaca 
cagtataaaa ctcaagttgt tgctggaaca 
aataaatata tgcacttgaa agtattcaaa 
cttactggat accaggttga caaaaacaag 
gtacccaaag tgttctgatt ccttcaactg 
ataaccatca ataaagaagc attcttttcc 



tgtccagcaa agaagcaatc agccaaaatg 60 

gccactccag aaatccagga gattgttgat 12 0 

aatgagactt atggaaaatt ggaagctgtg 180 

aattactaca ttaaggtacg agcaggtgat 240 

agtcttcccg gacaaaatga ggacttggta 300 

gatgacgagc tgacgggctt ttagcagcat 360 

gctactgagt catgatcctt gctgataaat 420 
a 451 



<210> 176 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_005213 



<400> 176 

aactggctac tgagtcatga tccttgctga taaatataac catcaataaa gaagcattct 



60 



<210> 177 
<211> 366 
<212> DNA 
<213> Homo sapiens 
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<300> 

<308> NML_005218 
<400> 177 

gtcagctcag cctccaaagg agccagcctc tccccagttc ctgaaatcct gagtgttgcc 60 
tgccagtcgc catgagaact tcctaccttc tgctgtttac tctctgctta cttttgtctg 12 0 
agatggcctc aggtggtaac tttctcacag gccttggcca cagatctgat cattacaatt 180 
gcgtcagcag tggagggcaa tgtctctatt ctgcctgccc gatctttacc aaaattcaag 240 
gcacctgtta cagagggaag gccaagtgct gcaagtgagc tgggagtgac cagaagaaat 300 
gacgcagaag tgaaatgaac tttttataag cattctttta ataaaggaaa attgcttttg 360 
aagtat 3 66 

<210> 178 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005218 
<400> 178 

gggagtgacc agaagaaatg acgcagaagt gaaatgaact ttttataagc attcttttaa 6 0 

<210> 179 
<211> 1519 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> N]yL_005326 
<400> 179 

ctgcctcgga acgctgtccc ccgcagcgac ggcccgttcc acctcgcgat ctgccgggta 60 

cccgggcggc gtggcgctcg gcctccaggg atccactgtg cggtgccaaa aaagaggcgg 12 0 

aggctcgcgg cacagctctc ccggcgcagc tctcgggccg ccgccgccgc tcccaggccc 180 

gtctcccggc ccgtggcagt cggggctcgc ggacaaaaca agttgagcgc gagcgcgttg 240 

attggttggc ggacggtgcg aggtggacgc tgattggctg agggcagcgc gaggcgggcg 3 00 

ctgattggct gcgacgcgcc gacgccggtg ttttgcagtc ctgggcagct cggcagtcca 3 60 

gcccggcccg ggtcatggtg gtgggccgag ggctgctcgg ccgccgcagc ctcgccgcgc 42 0 

tgggagccgc ctgcgcccgc cgaggcctcg gtccagccct gctgggagtt ttctgccaca 480 

cagatttgcg gaagaacctg accgtggacg agggcaccat gaaggtagag gtgctgcctg 540 

ccctgaccga caactacatg tacctggtca ttgatgatga gaccaaggag gctgccattg 600 

tggatccggt gcagccccag aaggtcgtgg acgcggcgag aaagcacggg gtgaaactga 660 

ccacagtgct caccacccac caccactggg accatgctgg cgggaatgag aaactggtca 72 0 

agctggagtc gggactgaag gtgtacgggg gtgacgaccg tatcggggcc ctgactcaca 7 80 

agatcactca cctgtccaca ctgcaggtgg ggtctctgaa cgtcaagtgc ctggcgaccc 840 

cgtgccacac ttcaggacac atttgttact tcgtgagcaa gcccggaggc tcggagcccc 900 

ctgccgtgtt cacaggtgac accttgtttg tggctggctg cgggaagttc tatgaaggga 960 

ctgcggatga gatgtgtaaa gctctgctgg aggtcttggg ccggctcccc ccggacacaa 1020 

gagtctactg tggccacgag tacaccatca acaacctcaa gtttgcacgc cacgtggagc 1080 

ccggcaatgc cgccatccgg gagaagctgg cctgggccaa ggagaagtac agcatcgggg 1140 

agcccacagt gccatccacc ctggcagagg agtttaccta caaccccttc atgagagtga 1200 

gggagaagac ggtgcagcag cacgcaggtg agacggaccc ggtgaccacc atgcgggccg 126 0 

tgcgcaggga gaaggaccag ttcaagatgc cccgggactg aggccgccct gcaccttcag 132 0 

cggatttggg gattaggctc ttttaggtaa ctggctttcc tgctggtccg tgcgggaaat 1380 

tcagtcttga tttaacctta attttacagc ccttggcttg tgttatcgga cattctaatg 1440 

catatttata agagaagttt aacaagtatt tattcccata aaaaaaaaaa aaaaaaaaaa 1500 
aaaaaaaaaa aaaaaaaaa 1519 

<210> 180 
<211> 60 
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212> DNA 

213> Homo sapiens 
300> 

308> NM_005326 

cttgtgttat cggacattct aatgcatatt tataagagaa gtttaacaag tatttattcc 60 

<210> 181 

<211> 3378 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005461 



<400> 181 
acagctgcac 
agagagcgag 
tgcctctcca 
tctggggcgg 
gcttggccgg 
ctcttccccc 
agcggcggct 
ccaccagccc 
agaaggagcc 
ccggctcggt 
gcttcagccc 
accagcagat 
tcggctcgca 
accaccacca 
acgacgagct 
cgccgtccag 
cgcacgcgac 
ccgacgacca 
ccaaggacga 
cccagtcttg 
agctcattca 
acgcctacaa 
ccagcgacag 
gcccttgccc 
tgtccctgcc 
gcgggccgcg 
tagcgcacgc 

gggaggtccc 

tcaccaaggc 
tgctttgctg 
ctctggcgtc 
tgtctgtcag 
cgctcggcgc 
aaatctgcat 
tgtaccattt 
tctttgggtg 
cccagctgta 
ctgagggtct 
aaaaaaagaa 
tggaagttgc 
gctttagttt 
cagggcaggg 
cttgtcttat 



cgccgagctg 
caacgggcac 
ccccgcccga 
tgcaggcagc 
cggccccggc 
ccgcgcttgg 
gcgcctcgct 
gctggccatg 
actggggcgc 
gtcctccaca 
gaccgaacag 
gaaccccgag 
cccagtgcca 
tcaccaccac 
gggcccgcac 
cgccgctagc 
ggcctcggcg 
gctcgtgtcc 
ggtgatccgc 
caggtataaa 
gcaggtggag 
ggtcaagtgc 
cccctcctct 
cggcccggac 
gcggccccag 
ggcgacgggc 
cagcgccgcc 
ggagtaactt 
catctggaga 
cccggagagt 
ccggcaggcg 
agttcggaga 
ccaaatctca 
taaaaatatt 
tcttgtttaa 
agaagggatc 
ggcttgtaaa 
ttgtttgggt 
aaagttccgg 
atgttttctt 
tcaaccttgt 
cagggaagct 
ggtcaaattg 



cgagcggctg 
tcgccccacg 
ctctacccgg 
ctcgggactc 
ccgctgcgcc 
ctcggcgcgc 
tcagcgatgg 
gagtatgtca 
gcggagcgtc 
ccgctcagca 
aagacacacc 
gcgctcaacc 
cagccgctgc 
cctcacccgc 
gctcacccgc 
ccggcgcaac 
acggcggcgg 
atgtccgtgc 
ctgaagcaga 
cgcgtccagc 
cagcttaagc 
gagaaactcg 
cccgagttct 
tccctgtccc 
ccttgacctg 
gggtgcgcgg 
tcctagactc 
ctctccaggc 
ctcctggctt 
agtccgcgcc 
agaggcgagg 
acactggctc 
tcagttttat 
aatcctgcat 
ctttagcatg 
gcagtttgga 
tacccgcccc 
ttattattac 
gcattttgca 
tccttccctt 
tggtgctgag 
gccaagctcc 
atacccttaa 



cgagcgagag 
cctcccctca 
cccggtccct 
tccggcgcgc 
ctcgctcccc 
tccggccggc 
ccgcggagct 
acgacttcga 
cgggcaggcc 
ctccgtgtag 
tcgaggatct 
tgacgcccga 
aaagcttcga 
accacgcgta 
accatcacca 
agctgcccac 
gcggcaacgg 
gcgagctgaa 
agcggcggac 
agaagcacca 
aggaggtgtc 
ccaactccgg 
ttctgtgagt 
acgtccctag 
tttgacttga 
gcgggcaggg 
gagcagagcc 
tgaagggcgg 
tctgaacttt 
aggaagagag 
ctgagcgaaa 
tcagccctga 
tgcctgctcg 
gctggacatg 
ttgttgatca 
aactccggcg 
gccaaaccgc 
ggtatttttg 
tcagaaaaca 
atccccattc 
agagagaacc 
gcaccccaga 
taagaaagga 



agcgtaagag 
gccccaccgc 
gcgcgggcac 
cgccgcgtcc 
gcctccccag 
cgcaaagttt 
gagcatgggg 
cctgctcaag 
ctgcacacgc 
ctccgtgccc 
gtactggatg 
ggacgcggtg 
cagctttcgc 
cccgggcgcc 
tcatcaccaa 
tagccacccc 
cagcgtggag 
ccgccacctg 
cctgaagaac 
cctggagaat 
ccggctggcc 
cttcagggag 
cgtggccggt 
tcccagacta 
gcgagaggga 
gaccttggct 
ggagagagag 
cgaggcatag 
gcgcgttaag 
caacgaggaa 
gaaggaagga 
gacacaggcc 
attatataga 
tatggtaata 
tggatcatac 
gctgcgtgcg 
atagagaacg 
tttgtaagtt 
actttgtctt 
ggtcctcttt 
gagaggtccc 
ggagtgttct 
aaggaaagga 



caagagagct 
gcgctccgct 
agcccagagc 
ccagacaaag 
ctcttctccg 
cccgggcggc 
ccagagctgc 
ttcgacgtga 
ctgcagccag 
tcgtcgccca 
gcgagcaact 
gaagcgctca 
ggcgctcacc 
ggcgtggccc 
gcgtcgccgc 
gggcccgggc 
gaccgcttct 
cggggcttca 
cggggctacg 
gagaagacgc 
cgcgagagag 
gcgggctcca 
cctggccccc 
ccccggaccc 
ggaagggcgc 
aaggcgagag 
acgagagggt 
tcccgagaag 
ccgggacagc 
aggagaggga 
cagacggacc 
tcagttagga 
aaaatacaaa 
atttctattt 
tccccttgtt 
gggtttcagt 
tggcagcaag 
aaaaagaaaa 
ggggcacact 
ttcctctctc 
agtacaaggg 
ggactacagc 
aaacagatcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 
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tcccctctgc 
ctgcgctaat 
tggcagtgca 
gaaacccaaa 
ttgcgtttct 
tcaattcttc 
tttttttttt 
aaagtgtatt 
agtttgtatt 
gcagttttaa 
ttcaactgtg 
agggatcaat 
tttattcaga 
aaaaaaaaaa 



tttttattgt aaccagaatc accctgaggt cccttctgaa ccctctgggc 2 640 

tgtaggagcc acagcgctcc tagggtgaga ggcttagcca tccctgaccc 2700 

ctggtaagca gacactgcac tgaaccaact gctatgctca gaatgtacca 2760 

cattggcaag taattttgca actttcaagt gcgttcttta gaccaatgca 2 82 0 

ttccctgctt ttgagatagt aggaagagtt cttggtggtg tcccccccct 2 880 

agttgtatag tagttatagg gaagatatgg gtgtttttct ttattattac 2 940 

ctgcaggtca gtaaaaggat ttaagttgca ctgacaaaaa taccaaaata 3 000 

tttaagttcc catttgaaat tgctggcgct gctggccgga tgcatttttg 3 06 0 

agttgataaa ttaacagtaa taacaagatt gtatgaaccg catggtgctt 312 0 

atattgtgga tatttgtcct gcatcagaaa cgagctttgg tttttacaga 3180 

ttgaaatcaa acctgccgca acagaaattg tttttatttc atgtaaaata 3 240 

ttcaaaccct gcttatgata tgaaaatatt aaaacctagt ctattgtagt 33 0 0 

ctggtttctg ttttttggtt attaaaatgg tttcctattt tgcttattaa 33 60 
aaaaaaaa 3378 



<210> 182 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005461 
<400> 182 

atttgtcctg catcagaaac gagctttggt ttttacagat tcaactgtgt tgaaatcaaa 60 

<210> 183 

<211> 597 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005532 



<400> 183 

agctgaagtt 

gcctctgctc 

ggctctgccg 

gccatggcgg 

tcgtcctcca 

gcctcgggca 

accaagttca 

tagctccctg 

catcctgacc 

ctgcatctcc 



gaggatctct 
tcacctcatc 
tagttttgcc 
ctgtgcccat 
tagcagccaa 
gccttgtggg 
tcctgggctc 
cccctcgccc 
cagcgaggag 
agaggaaaat 



tactctctaa 
agcagtgacc 
cctggccagg 
ggtgctcagt 
gatgatgtcc 
tactctgcag 
cattgggtct 
tgcagagaag 
ccaactatcc 
aagaaataaa 



gccacggaat 
agtgtggcca 
attgctacag 
gccatgggct 
gcggcggcca 
tcactgggag 
gccattgcgg 
agaaccatgc 
caaatatacc 
gatgaattgt 



taacccgagc 
aagtggtcag 
ttgtgattgg 
tcactgcggc 
ttgccaatgg 
caactggact 
ctgtcattgc 
caggggagaa 
tgggtgaaat 
tgcaactctt 



aggcatggag 6 0 

ggtggcctct 12 0 

aggagttgtg 180 

gggaatcgcc 240 

gggtggagtt 3 00 

ctccggattg 3 60 

gaggttctac 42 0 

ggcacccagc 480 

ataccaaatt 540 
aaaaaaa 597 



<210> 184 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005532 
<400> 184 

agccaactat cccaaatata cctgggtgaa atataccaaa ttctgcatct ccagaggaaa 60 



<210> 185 

<211> 1661 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> NM_005566 
<400> 185 

tgctgcagcc gctgccgccg attccggatc tcattgccac gcgcccccga cgaccgcccg 60 

acgtgcattc ccgattcctt ttggttccaa gtccaatatg gcaactctaa aggatcagct 12 0 

gatttataat cttctaaagg aagaacagac cccccagaat aagattacag ttgttggggt 180 

tggtgctgtt ggcatggcct gtgccatcag tatcttaatg aaggacttgg cagatgaact 240 

tgctcttgtt gatgtcatcg aagacaaatt gaagggagag atgatggatc tccaacatgg 3 00 

cagccttttc cttagaacac caaagattgt ctctggcaaa gactataatg taactgcaaa 3 60 

ctccaagctg gtcattatca cggctggggc acgtcagcaa gagggagaaa gccgtcttaa 42 0 

tttggtccag cgtaacgtga acatatttaa attcatcatt cctaatgttg taaaatacag 480 

cccgaactgc aagttgctta ttgtttcaaa tccagtggat atcttgacct acgtggcttg 540 

gaagataagt ggttttccca aaaaccgtgt tattggaagt ggttgcaatc tggattcagc 60 0 

ccgattccgt tacctgatgg gggaaaggct gggagttcac ccattaagct gtcatgggtg 660 

ggtccttggg gaacatggag attccagtgt gcctgtatgg agtggaatga atgttgctgg 72 0 

tgtctctctg aagactctgc acccagattt agggactgat aaagataagg aacagtggaa 7 80 

agaggttcac aagcaggtgg ttgagagtgc ttatgaggtg atcaaactca aaggctacac 840 

atcctgggct attggactct ctgtagcaga tttggcagag agtataatga agaatcttag 9 00 

gcgggtgcac ccagtttcca ccatgattaa gggtctttac ggaataaagg atgatgtctt 96 0 

ccttagtgtt ccttgcattt tgggacagaa tggaatctca gaccttgtga aggtgactct 102 0 

gacttctgag gaagaggccc gtttgaagaa gagtgcagat acactttggg ggatccaaaa 10 80 

ggagctgcaa ttttaaagtc ttctgatgtc atatcatttc actgtctagg ctacaacagg 1140 

attctaggtg gaggttgtgc atgttgtcct ttttatctga tctgtgatta aagcagtaat 12 00 

attttaagat ggactgggaa aaacatcaac tcctgaagtt agaaataaga atggtttgta 12 6 0 

aaatccacag ctatatcctg atgctggatg gtattaatct tgtgtagtct tcaactggtt 13 2 0 

agtgtgaaat agttctgcca cctctgacgc accactgcca atgctgtacg tactgcattt 13 80 

gccccttgag ccaggtggat gtttaccgtg tgttatataa cttcctggct ccttcactga 1440 

acatgcctag tccaacattt tttcccagtg agtcacatcc tgggatccag tgtataaatc 1500 

caatatcatg tcttgtgcat aattcttcca aaggatctta ttttgtgaac tatatcagta 1560 

gtgtacatta ccatataatg taaaaagatc tacatacaaa caatgcaacc aactatccaa 162 0 
gtgttatacc aactaaaacc cccaataaac cttgaacagt g 16 61 

<210> 186 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD05566 
<400> 186 

catcaactcc tgaagttaga aataagaatg gtttgtaaaa tccacagcta tatcctgatg 60 

<210> 187 

<211> 2993 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005689 
<400> 187 

gggcctgcag ttggcagaag ggtcccgggc ccagagccag cggggccgtg ctgagacggc 60 

gtacgtgccc tgcgtgagtg cgtggcggcg gcgcgtgcgc taggggagtg ggcggtgagg 12 0 

cctggtccac gtgcgtccct tcccgggacc cccgcagctt ggcgcccagc ggctacgtga 180 

gccaaggcac ccggatgtcc gcgcccctct ccgagtgaca agtcccggcc tccggtcccg 240 

cagtgcccgc agcctcggcc ggcgtccacg cattgccatg gtgactgtgg gcaactactg 3 00 

cgaggccgaa gggcccgtgg gtccggcctg gatgcaggat ggcctgagtc cctgcttctt 3 60 

cttcacgctc gtgccctcga cgcggatggc tctagggact ctggccttgg tgctggctct 420 
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icctgcaga cgccgggagc ggcccgctgg tgctgattcg ctgtcttggg gggccggccc 480 

igcatctct ccctacgtgc tgcagctgct tctggccaca cttcaggcgg cgctgcccct 540 

rccggcctg gctggccggg tgggcactgc ccggggggcc ccactgccaa gctatctact 600 

itggcctcc gtgctggaga gtctggccgg cgcctgtggc ctgtggctgc ttgtcgtgga 660 

iggagccag gcacggcagc gtctggcaat gggcatctgg atcaagttca ggcacagccc 720 

rgtctcctg ctcctctgga ctgtggcgtt tgcagctgag aacttggccc tggtgtcttg 780 

.acagccca cagtggtggt gggcaagggc agacttgggc caacaggttc agtttagcct 840 

;gggtgctg cggtatgtgg tctctggagg gctgtttgtc ctgggtctct gggcccctgg 900 

ittcgtccc cagtcctata cattgcaggt tcatgaagag gaccaagatg tggaaaggag 960 

:aggttcgg tcagcagccc aacagtctac ctggcgagat tttggcagga agctccgcct 102 0 

:tgagtggc tacctgtggc ctcgagggag tccagctctg cagctggtgg tgctcatctg 1080 

itggggctc atgggtttgg aacgggcact caatgtgttg gtgcctatat tctataggaa 1140 

Lttgtgaac ttgctgactg agaaggcacc ttggaactct ctggcctgga ctgttaccag 12 00 

:acgtcttc ctcaagttcc tccagggggg tggcactggc agtacaggct tcgtgagcaa 1260 

:tgcgcacc ttcctgtgga tccgggtgca gcagttcacg tctcggcggg tggagctgct 132 0 

ttcttctcc cacctgcacg agctctcact gcgctggcac ctggggcgcc gcacagggga 13 80 

rtgctgcgg atcgcggatc ggggcacatc cagtgtcaca gggctgctca gctacctggt 1440 

;tcaatgtc atccccacgc tggccgacat catcattggc atcatctact tcagcatgtt 15 00 

;tcaacgcc tggtttggcc tcattgtgtt cctgtgcatg agtctttacc tcaccctgac 15 60 

Lttgtggtc actgagtgga gaaccaagtt tcgtcgtgct atgaacacac aggagaacgc 162 0 

Lcccgggca cgagcagtgg actctctgct aaacttcgag acggtgaagt attacaacgc 1680 

fagagttac gaagtggaac gctatcgaga ggccatcatc aaatatcagg gtttggagtg 1740 

tagtcgagc gcttcactgg ttttactaaa tcagacccag aacctggtga ttgggctcgg 180 0 

:tcctcgcc ggctccctgc tttgcgcata ctttgtcact gagcagaagc tacaggttgg 1860 

ractatgtg ctctttggca cctacattat ccagctgtac atgcccctca attggtttgg 192 0 

icctactac aggatgatcc agaccaactt cattgacatg gagaacatgt ttgacttgct 19 80 

laagaggag acagaagtga aggaccttcc tggagcaggg ccccttcgct ttcagaaggg 2 040 

:gtattgag tttgagaacg tgcacttcag ctatgccgat gggcgggaga ctctgcagga 2100 

ftgtctttc actgtgatgc ctggacagac acttgccctg gtgggcccat ctggggcagg 2160 

lagagcaca attttgcgcc tgctgtttcg cttctacgac atcagctctg gctgcatccg 222 0 

itagatggg caggacattt cacaggtgac ccaggcctct ctccggtctc acattggagt 22 80 

[tgccccaa gacactgtcc tctttaatga caccatcgcc gacaatatcc gttacggccg 234 0 

[tcacagct gggaatgatg aggtggaggc tgctgctcag gctgcaggca tccatgatgc 240 0 

tttatggct ttccctgaag ggtacaggac acaggtgggc gagcggggac tgaagctgag 24 6 0 

jgcggggag aagcagcgcg tcgccattgc ccgcaccatc ctcaaggctc cgggcatcat 252 0 

:tgctggat gaggcaacgt cagcgctgga tacatctaat gagagggcca tccaggcttc 25 80 

itggccaaa gtctgtgcca accgcaccac catcgtagtg gcacacaggc tctcaactgt 2 640 

jtcaatgct gaccagatcc tcgtcatcaa ggatggctgc atcgtggaga ggggacgaca 27 0 0 

jaggctctg ttgtcccgag gtggggtgta tgctgacatg tggcagctgc agcagggaca 27 6 0 

jaagaaacc tctgaagaca ctaagcctca gaccatggaa cggtgacaaa agtttggcca 282 0 

itccctctc aaagactaac ccagaaggga ataagatgfcg tctcctttcc ctggcttatt 2880 

;atcctggt cttggggtat ggtgctagct atggtaaggg aaagggacct ttccgaaaaa 294 0 
itcttttgg ggaaataaaa atgtggactg tgaaaaaaaa aaaaaaaaaa aaa 2993 

>10> 188 
>11> 60 
>12> DNA 

113> Homo sapiens 
500> 

J08> NM_005689 
L00> 188 

jaaagggac ctttccgaaa aacatctttt ggggaaataa aaatgtggac tgtgaaaaaa 60 

J10> 189 
>11> 1830 
>12> DNA 

113> Homo sapiens 
J00> 
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<308> NM_005749 
<400> 189 

ggggagttga aacctaattt tgtggcgtag cagctatgca gcttgaaatc caagtagcac 6 0 

taaattttat tatttcgtat ttgtacaata agcttcccag gagacgtgtc aacatttttg 12 0 

gtgaagaact tgaaagactt cttaagaaga aatatgaagg gcactggtat cctgaaaagc 180 

catacaaagg atcggggttt agatgtatac acatagggga gaaagtggac ccagtgattg 2 40 

aacaagcatc caaagagagt ggtttggaca ttgatgatgt tcgtggcaat ctgccacagg 3 00 

atcttagtgt ttggatcgac ccatttgagg tttcttacca aattggtgaa aagggaccag 3 60 

tgaaggtgct ttacgtggat gataataatg aaaatggatg tgagttggat aaggagatca 42 0 

aaaacagctt taacccagag gcccaggttt ttatgcccat aagtgaccca gcctcatcag 480 

tgtccagctc tccatcgcct ccttttggtc actctgctgc tgtaagccct accttcatgc 540 

cccggtccac tcagccttta acctttacca ctgccacttt tgctgccacc aagttcggct 600 

ctaccaaaat gaagaatagt ggccgtagca acaaggttgc acgtacttct cccatcaacc 660 

tcggcttgaa tgtgaatgac ctcttgaagc agaaagccat ctcttcctca atgcactctc 72 0 

tgtatgggct tggcttgggt agccagcagc agccacagca acagcagcag ccagcccagc 7 80 

cgccaccgcc accaccacca ccacagcagc aacaacagca gaaaacctct gctctttctc 840 

ctaatgccaa ggaatttatt tttcctaata tgcagggtca aggtagtagt accaatggaa 9 00 

tgttcccagg tgacagcccc cttaacctca gtcctctcca gtacagtaat gcctttgatg 960 

tgtttgcagc ctatggaggc ctcaatgaga agtcttttgt agatggcttg aattttagct 102 0 

taaataacat gcagtattct aaccagcaat tccagcctgt tatggctaac taaaaaaaag 1080 

aaaatgtatc gtacaagtta aaatgcacgg gcccaagggg gatttttttt ttcacctcct 1140 

tgagaatttt tttttttaag cttatagtaa ggatacattc aagcttggtt aaaaaaataa 12 00 

taataaaaca tgcatcattt ttcatttgcc aaccaagcac aaagttattt tatactgact 12 60 

gtatatttta aagtatactc tcagatatgg cctcttacag tatttaagat atagcaagga 13 2 0 

catggctgat ttttttttat aaaaattggc actaataagt gggtttattg gtcttttcta 13 80 

attgtataat ttaatttagt acaaagtttg taaaatatca gaggatatat atatattgtt 1440 

tctacgacat ggtattgcat ttatatcttt ttactacagt gatctgtgac agcagcagct 1500 

tcatgttgta ttttttttac tgaaattgta aaatatccat cttaaagaca tcaactattc 1560 

taaaaattgt gtacaggata ttcctttagt ggtggaatta aaatgtacga atacttgctt 162 0 

tttcaaaaaa atgtattttc tgttaaaagt ttaaagattt ttgctatata ttatggaaga 1680 

aaaatgtaat cgtaaatatt aattttgtac ctatattgtg caatacttga aaaaaacggt 1740 

ataaaagtat tttgagtcag tgtcttacat gttaagaggg actgaaatag tttatattaa 180 0 
gtttgtatta aaattcttta aaattaaaaa 183 0 



<210> 190 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005749 
<400> 190 

aaacctctgc tctttctcct aatgccaagg aatttatttt tcctaatatg cagggtcaag 60 

<210> 191 
<211> 1534 
<212> DNA 

<213> Homo sapiens 
<3 0 0> 

<308> NM_005804 



<400> 191 

ggaagcgcag caactcgtgt ctgagcgccc 
agcagcgcgc ggagaagaac ggggagccag 
tcttttggat tacgatgaag aggaagagcc 



ggcggaaaac cgaagttgga agtgtctctt 60 
catcatggca gaacaggatg tggaaaacga 12 0 
ccaggctcct caagagagca caccagctcc 180 
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ccctaagaaa gacatcaagg gatcctacgt ttccatccac agctctggct tccgggactt 240 

tctgctgaag ccggagctcc tgcgggccat cgtggactgt ggctttgagc atccttctga 300 

ggtccagcat gagtgcattc cccaggccat cctgggcatg gacgtcctgt gccaggccaa 360 

gtccgggatg ggcaagacag cggtcttcgt gctggccacc ctacagcaga ttgagcctgt 420 

caacggacag gtgacggtcc tggtcatgtg ccacacgagg gagctggcct tccagatcag 480 

caaggaatat gagcgctttt ccaagtacat gcccagcgtc aaggtgtctg tgttcttcgg 540 

tggtctctcc atcaagaagg atgaagaagt gttgaagaag aactgtcccc atgtcgtggt 600 

ggggaccccg ggccgcatcc tggcgctcgt gcggaatagg agcttcagcc taaagaatgt 660 

gaagcacttt gtgctggacg agtgtgacaa gatgctggag cagctggaca tgcggcggga 720 

tgtgcaggag atcttccgcc tgacaccaca cgagaagcag tgcatgatgt tcagcgccac 780 

cctgagcaag gacatccggc ctgtgtgcag gaagttcatg caggatccca tggaggtgtt 840 

tgtggacgac gagaccaagc tcacgctgca cggcctgcag cagtactacg tcaaactcaa 900 

agacagtgag aagaaccgca agctctttga tctcttggat gtgctggagt ttaaccaggt 960 

gataatcttc gtcaagtcag tgcagcgctg catggccctg gcccagctcc tcgtggagca 102 0 

gaacttcccg gccatcgcca tccaccgggg catggcccag gaggagcgcc tgtcacgcta 1080 

tcagcagttc aaggatttcc agcggcggat cctggtggcc accaatctgt ttggccgggg 1140 

gatggacatc gagcgagtca acatcgtctt taactacgac atgcctgagg actcggacac 12 0 0 

ctacctgcac cgggtggccc gggcgggtcg ctttggcacc aaaggcctag ccatcacttt 126 0 

tgtgtctgac gagaatgatg ccaaaatcct caatgacgtc caggaccggt ttgaagttaa 132 0 

tgtggcagaa cttccagagg aaatcgacat ctccacatac atcgagcaga gccggtaacc 13 80 

accacgtgcc agagccgccc acccggagcc gcccgcatgc agcttcacct cccctttcca 1440 

ggcgccactg ttgagaagct agagattgta tgagaataaa cttgttatta tggaaaaaaa 15 0 0 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 1534 

<210> 192 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005804 
<400> 192 

gttgagaagc tagagattgt atgagaataa agtgttatta tgaaatgaag aagcctcacc 60 

<210> 193 
<211> 1416 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> Modi f i ed_bas e 
<222> 1 . . . 1416 
<223> n = a,c,g, or t 

<300> 

<308> NM_005945 
<400> 193 

aggaattccg gaattccgga attccgatgg atggaacaga aaataaatct aagtttggtg 60 

cgaacgccat tctgggggtg tcccttgccg tctgcaaagc tggtgccgtt gagaaggggg 12 0 

tcccctgtac cgccacatcg cgtacttggc tggcaacttc gaagtcatcc tgccagtccc 180 

ggcgttcaag tgtcatcatc aatggcggtt ctcatgctgg caacaagctg gccatgcaga 240 

gtctgtcctc ccagtcggtg cagcaaactc agggaagcca tgccgcattg gagcagaggt 3 00 

ttaccacaac ctgaagaatg tcatcaagga gaaatatggg aaagatgcca ccaatgtggg 3 60 

gatttgcgcg ggtttgctcc caacatcctg gagaataaag aaggcctgga gctgctgaag 420 

actgctattg gaaagcctgg cctacactgt aaaggtggtc atggcatgga cgtagcggcc 480 

tccgagttct tcaggtcagg gaactatgac ctggacttca agtctcccga tgaccccagc 540 
aggtacatct cgcctgacca gctggctgac ctgtacaagt ccttcatcaa ggactaccca 600 
gtggtgtcta tcgaagatcc ctttgaccag gatgactggg gagcttcaga agttcacagc 660 
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cagtgcagga atccaggtag tggggggatg actcacagtg accaacccaa 
caaggcgtga acgagaagtc ctgcaactgc ctcctgctca aagtcaacca 
gtgaccgagt ctcttcaggc gtgcaagctg gcccaggcca atggttgggg 
tctcatcgtt cgggggagac tgaagatacc ttcatcgctg acctggttgt 
actggggcag atcaagactg gtgccccttg ccgatcacgc gcttggccaa 
ctcctcagaa ttgaagagga gctgggcagc aaggctaagt ttgccggcag 
aaccccttgg ccaagtaagc tgtgggcagg caagccttcg gtcacctgtt 
ccctcccctg gtgtcagctc aggcagctcg aggcccccga ccaacacttg 
tgctagttag cgcccaccgc cgtggagttc gtaccgcttc cttagaactc 
aagctccctg gaagccctgt tggcagctct agctttgcag ttgtgtaatt 
attgtttttc tcgccttact ttccaccaag tgtctagagt catgtgagcc 
tccggggtgg ccacaggcta gatccccggt ggttttgtgc tcaaaataaa 
gacccatgaa aaaaaaaaag gaattccgga attccg 1416 



agaggatcgc 
gattggctcc 
cgtcatggtg 

ggggctgtgc 

gtacaaccag 
gaacttcaga 
ggctacagac 
caggggtccc 
tacagaagcc 
ggcccaagtc 
tngtgtcatc 
aagcctcagt 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 



<210> 194 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_005945 



<400> 194 

ttgtgtaatt ggcccaagtc attgtttttc tcgccttact ttccaccaag tgtctagagt 60 



<210> 195 
<211> 961 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_006014 



<400> 195 

ggcgaccacg gtgtcttcaa aagccccgtc 
tgggtcagtt tgcaccagcg ctctggaatc 
aggaaaccgc agcctctcaa ccgctgaccg 
ggcgggaact gaccacgcgc cagtcaggct 
ggagtcgtgc gcagggggcg gggcttcggg 
cctgcgcttc gggggtggag tcggagcggc 
gacgcaggcg gaggcgctga cggcggggat 
gtggacacag ccgcagctcc ggccggtgga 
agagacgccg cgtctgcggc cagggggtca 
gtgcctttcc cgaccccctt ggaggcggaa 
gagccccacc aaagggtggt tgggaaggat 
cgctggaaag ctgaagactg tcgcctgctc 
ctttccctgg tggtgcggac catgcagcgc 
cctgggcaaa tggagcgagg tcccactttg 
ccctagggca ggaattccca cagttgctac 
ttcttaaatg tttgttacta cagaaaataa 
a 961 



agggttggct tcctggggcc ggaccgactg 60 

gagttacgcg cgaaagggca gagtttctgg 12 0 

ggtctcagaa ggcccccggc agggccgctt 180 

ctccagggac ctgcgcaggc gcgtgtgggc 240 

aaggagccac agagagggcg gggcgtagga 3 00 

gcggcggcgg tcatgcggga cgcggatgca 3 60 

ggccggggtg gccacagctg ccgcgggggc 42 0 

gctcccccag cgcacgcgcc aggtccgggc 480 

cgaatgcggc cgcacatatt caccctcagc 540 

atcgcccatg ggtccctggc accagatgcc 600 

ctcacagtga gtggcaggat cctggtcgtc 660 

cgaatttccg tcatcaactt tcttgaccag 720 

tttgggcccc ccgtttcccg ctaagcctgg 7 80 

cgtctccttg taggcagtgc gtccatcctt 840 

tttcctggga gggcctcatg ttttatctgg 900 

aactgcgcta ctaaaaaaaa aaaaaaaaaa 96 0 



<210> 196 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006014 



<400> 196 
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ggcctcatgt tttatctggt tcttaaatgt ttgttactac agaaaataaa actgaggtat 



60 



<210> 197 

<211> 1648 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM__006086 



900 
960 
1020 



<400> 197 

atgcgggaga tcgtgcacat ccaggccggc cagtgcggca accagatcgg ggccaagttc 60 

tgggaagtca tcagtgatga gcatggcatc gaccccagcg gcaactacgt gggcgactcg 120 

gacttgcagc tggagcggat cagcgtctac tacaacgagg cctcttctca caagtacgtg 180 

cctcgagcca ttctggtgga cctggaaccc ggaaccatgg acagtgtccg ctcaggggcc 240 

tttggacatc tcttcaggcc tgacaatttc atctttggtc agagtggggc cggcaacaac 300 

tgggccaagg gtcactacac ggagggggcg gagctggtgg attcggtcct ggatgtggtg 360 

cggaaggagt gtgaaaactg cgactgcctg cagggcttcc agctgaccca ctcgctgggg 42 0 

ggggggacgg gctccggcat gggcacgttg ctcatcagca aggtgcgtga ggagtatccc 480 

gaccgcatca tgaacacctt cagcgtcgtg ccctcaccca aggtgtcaga cacggtggtg 540 

gaaccctaca acgccacgct gtccatccac cagctggtgg aaaacacgga tgaaacctac 600 

tgcatcgaca acgaggcgct ctacgacatc tgcttccgca ccctcaagct ggccacgccc 660 

acctacgggg acctcaacca cctggtatcg gccaccatga gcggagtcac cacctccttg 720 

cgcttcccgg gccagctcaa cgctgacctg cgcaagctgg ccgtcaacat ggtgcccttc 7 80 

ccgcgcctgc acttcttcat gcccggcttc gcccccctca ccaggcgggg cagccagcag 84 0 
taccgggccc tgaccgtgcc cgagctcacc cagcagatgt tcgatgccaa gaacatgatg 
gccgcctgcg acccgcgcca cggccgctac ctgacggtgg ccaccgtgtt ccggggccgc 
atgtccatga aggaggtgga cgagcagatg ctggccatcc agagcaagaa cagcagctac 

ttcgtggagt ggatccccaa caacgtgaag gtggccgtgt gtgacatccc gccccgcggc 1080 

ctcaagatgt cctccacctt catcgggaac agcacggcca tccaggagct gttcaagcgc 1140 

atctccgagc agttcacggc catgttccgg cgcaaggcct tcctgcactg gtacacgggc 12 00 

gagggcatgg acgagatgga gttcaccgag gccgagagca acatgaacga cctggtgtcc 1260 

gagtaccagc agtaccagga cgccacggcc gaggaagagg gcgagatgta cgaagacgac 1J2U 

gaggaggagt cggaggccca gggccccaag tgaaactgct cgcagctgga gtgagaggca 13 80 

ggtggcggcc ggggccgaag ccagcagtgt ctaaaccccc ggagccatct tgctgccgac 1440 

accctgcttt ccccatcgcc ctagggctcc cttgccgccc tcctgcagta tttatggcct 1500 

cgtcctcccc cacctaggcc acgtgtgagc tgctcctgtc tctgtcttat tgcagctcca 1560 

ggcctgacgt tttacggttt tgttttttac tggtttgtgt ttatattttc ggggatactt 162 0 
aataaatcta ttgctgtcag ataccctt 1648 

<210> 198 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006086 



<400> 198 

tttttactgg tttgtgttta tattttcggg gatacttaat aaatctattg ctgtcagata 

<210> 199 

<211> 3074 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006096 
<400> 199 

aacaaacctc gcctggctcc cagctggtgc tgaagctcgt cagttcacca tccgccctcg 



60 



60 
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gcttccgcgg ggcgctgggc cgccagcctc ggcaccgtcc tttcctttct ccctcgcgtt 
aggcaggtga cagcagggac atgtctcggg agatgcagga tgtagacctc gctgaggtga 
agcctttggt ggagaaaggg gagaccatca ccggcctcct gcaagagttt gatgtccagg 
agcaggacat cgagacttta catggctctg ttcacgtcac gctgtgtggg actcccaagg 



gaaaccggcc 

accccctctt 

tggacgcccc 

ccatggatca 

ttattggcat 

ctgagatggt 

actgggccgc 

acctttttgg 

acattgtgaa 

ggcgcgacct 

ctgctctgtt 

caaaattgga 

agatctccca 

acatgccctc 

tcacttctct 

cccacaccag 

tcacccccaa 

gctaggcggc 

cccggcccct 

ttattttgta 

ccaaaggcgg 

ttccattaac 

aaaaacaaga 

cctggttcaa 

ctgaggagaa 

taatagtgac 

attcatgtcg 

cgatttttaa 

ttctagcatc 

aaaaatattt 

cctctcctcc 

cagggagaag 

gctgccctgg 

tggaattggg 

tgtgatttct 

ttttcaatct 

gtcacgttct 

tggggagcgg 

ttccctctcc 

ctgacgtttc 

cattttggct 

aagtcagcca 

cagccttcca 

ccttccctgg 

ggaaactatt 

tccagaccaa 

aaaaaaaaaa 



tgtcatcctc 
caactacgag 
tggccagcag 
gctggctgaa 
gggaacagga 

ggagggcctt 

ctccaagatc 

gaaggaagaa 

tgacatgaac 

ggagattgag 

ggtggttggg 

cccaacaaag 

gccggccaag 

ggctagcatg 

ggatggcacc 

cgagggcacc 

ctcgggtgct 

ctgcccagct 

tttcgccccc 

agagtgagct 

ggtggcggtg 

cagtggccgg 

cgcgtagcag 

acgggtgcct 

gctgatccag 

atgcaggcac 

aggctagagg 

agttggtgca 

ctcttaatgt 

cagcaaacgt 

acagccccaa 

atcatttaga 

ctgtgatgag 

gttactcgat 

gtggatccca 

tcgacagctg 

tggatcctca 

tggctgagaa 

tgcttcctct 

tgggcggcca 

caccgtggat 

ctgggacccg 

ggtggcgtga 

ccttgagtag 

gtgcacaagt 

taaatttgta 

aaaa 3 074 



acctaccatg 

gacatgcagg 

gacggcgcag 

atgcttcctg 

gcaggcgcct 

gtccttatca 

tcaggatgga 

atgcagagta 

cccggcaacc 

cgaccaatgc 

gacagctcgc 

accactctcc 

ctcgctgagg 

acccgcctga 

cgcagccgct 

cgcagccgct 

gctgggaaca 

gccgcccccg 

tgcctgccat 

ctggtggaga 

gaccaaagga 

ttgccactct 

cacacacttc 

ggtcagaagg 

tttccggaaa 

ctcttttaaa 

catttggaac 

tctagaaagc 

gcagcaaaag 

tgggcatcat 

cctcccaccc 

tttgttttgc 

tgtccttgca 

gtaagggatt 

gcttggttcc 

ggctggaacg 

gaactctttg 

aatgtaagga 

tttccfcgctc 

gtgcggctgc 

tttctcatag 

aggatttctg 

gaggcaatga 

aaaagtcggg 

ctttccagag 

actttgcagc 



acatcggcat 

agatcaccca 

cctccttccc 

gagtccttca 

acatcctaac 

acgtgaaccc 

cccaagctct 

acgtggaagt 

tgcacctgtt 

cgggaaccca 

ctgcagtgga 

tcaagatggc 

ccttcaagta 

tgcggtcccg 

cccacaccag 

cgcacaccag 

gcgccgggcc 

gactctgatc 

actgcgccta 

caaatgaggt 

aggaagcaag 

cctcccctcc 

acaaagccaa 

ccagccgccc 

caaaatcctt 

caggcaaaac 

aacaaatcta 

tttgaatgca 

caggcgacaa 

ggtttttgaa 

ctgatacatg 

attccttaga 

ggggccggag 

ccttgttgtt 
aggaattttg 
tgaactcagt 
ctcttgtcgg 
ttctggaata 
cctaaccttt 
caggttcctg 
gaagtttggt 
ggaccccgca 
ctcgttacct 
gatcggggca 
gagtttctta 
ggaaaaaaaa 



gaaccacaaa 

gcactttgcc 

cgcagggtac 

acagtttggg 

tcgatttgct 

ttgtgcggaa 

gccggacatg 

ggtccacacc 

catcaatgcc 

cacagtcacc 

tgccgtggtg 

ggactgtggc 

cttcgtgcag 

cacagcctct 

cgagggcacc 

cgagggggcc 

caagtccatg 

tctgtagtgg 

actcggtatt 

ctattacgtg 

catctccgca 

ctcagagaca 

gcctaggccg 

acttcccgtt 

ttctcatttg 

aggaaggggg 

cgtagttaac 

gaagcaaaca 

aatctcctgg 

ggctttagtt 

agccagtgat 

atggagggca 

taggagcact 

gtgttgagat 

tgtgatbggc 

agctgaacct 

ggtgggggtg 

catattccat 

cgccgaatgg 

tactactgcc 

cagagtgaat 

gttgggagga 

gccgcccatc 

agagaggctg 

atgagatatt 

aaaaaaaaaa 



acctgctaca 

gtctgccacg 

atgtacccct 

ctgaaaagca 

ctaaacaacc 

ggctggatgg 

gtggtgtccc 

taccgccagc 

tacaacagcc 

ctgcagtgcc 

gagtgcaact 

ggcctcccgc 

ggcatgggat 

ggttccagcg 

cgaagccgct 

cacctggaca 

gaggtctcct 

ccccctcctc 

aatccaaagc 

ggtgccctct 

tcgcatcctc 

ccaaactgcc 

ccctgagcat 

tcctctttaa 

gggagggggg 

aaaaggtggg 

ttgaagaaac 

agcttgattt 

ctttacagac 

ctgctttctg 

tattcttgtt 

acattccaca 

ggggtggggg 

ccagtgcagt 

ttaaatccag 

gtctgacccg 

ggaactcacg 

gggactttcc 

ggcagcacca 

ttgtactttt 

tgaatattgt 

ggaagtagtc 

accttggagg 

agtacggatg 

tgtatttatt 

aaaaaaaaaa 



120 
180 
240 
300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 



<210> 200 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006096 
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<400> 200 

gagtacggat gggaaactat tgtgcacaag tctttccaga ggagtttctt aatgagatat 6 0 

<210> 201 

<211> 2148 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> nm_006115 



<400> 201 
gcttcagggt 
cgggacaccc 
actctctgag 
gagacctaga 
acgaaggcgt 
cccacggaga 
tgccgccctg 
cgggagacac 
tctgggagtg 
tggacttgat 
ggatttacgg 
tctgtactca 
tggtttgagc 
cctcaaggaa 
gaaaaatgta 
tatcaagatg 
tacctggaag 
gcgtagactc 
gcagtatatc 
tgtggactct 
ccccttggaa 
gtcccagagt 
cgatgtaagt 
cctggtcttt 
gagccactgc 
cttgcagagt 
tgtccccctg 
tctgcatgcc 
tagtgccaac 
gtgcccctgt 
ttggacacta 
acaaatgttc 
gttcagtgag 
gtgatctttg 
gattctggct 
tgttgaaaat 



acagctcccc 
cacccgcttc 
gaaaaaccat 
aatccaagcg 
ttgtggggtt 
cttgtggagc 
gagttgctgc 
agccagaccc 
ctgatgaagg 
gtgctccttg 
aagaactctc 
tttccagagc 
acagaggcag 
ggtgcctgtg 
ctacgcctgt 
atcctgaaaa 
ctacccacct 
ctcctctccc 
gcccagttca 
ttatttttcc 
accctctcaa 
cccagcgtca 
cccgagcccc 
gatgagtgtg 
tcccagctta 
ctcctgcagc 
gagagttatg 
aggctcaggg 
ccctgtcctc 
ttcatgccta 
aagccaggat 
agtgtgagtg 
gaaaaaaagg 
gggagataca 
tgggaagtac 
aaagagaagc 



cgcagccaga agccgggcct gcagcgcctc agcaccgctc 60 

ccaggcgtga cctgtcaaca gcaacttcgc ggtgtggtga 12 0 

tttgattatt actctcagac gtgcgtggca acaagtgact 180 

ttggaggtcc tgaggccagc ctaagtcgct tcaaaatgga 240 

ccattcagag ccgatacatc agcatgagtg tgtggacaag 3 00 

tggcagggca gagcctgctg aaggatgagg ccctggccat 3 60 

ccagggagct cttcccgcca ctcttcatgg cagcctttga 42 0 

tgaaggcaat ggtgcaggcc tggcccttca cctgcctccc 480 

gacaacatct tcacctggag accttcaaag ctgtgcttga 5 40 

cccaggaggt tcgccccagg aggtggaaac ttcaagtgct 600 

atcaggactt ctggactgta tggtctggaa acagggccag 660 

cagaagcagc tcagcccatg acaaagaagc gaaaagtaga 72 0 

agcagccctt cattccagta gaggtgctcg tagacctgtt 7 80 

atgaattgtt ctcctacctc attgagaaag tgaagcgaaa 840 

gctgtaagaa gctgaagatt tttgcaatgc ccatgcagga 9 00 

tggtgcagct ggactctatt gaagatttgg aagtgacttg 9 60 

tggcgaaatt ttctccttac ctgggccaga tgattaatct 1020 

acatccatgc atcttcctac atttccccgg agaaggaaga 1080 

cctctcagtt cctcagtctg cagtgcctgc aggctctcta 1140 

ttagaggccg cctggatcag ttgctcaggc acgtgatgaa 1200 

taactaactg ccggctttcg gaaggggatg tgatgcatct 12 60 

gtcagctaag tgtcctgagt ctaagtgggg tcatgctgac 132 0 

tccaagctct gctggagaga gcctctgcca ccctccagga 13 80 

ggatcacgga tgatcagctc cttgccctcc tgccttccct 144 0 
caaccttaag cttctacggg aattccatct ccatatctgc 1500 
acctcatcgg gctgagcaat ctgacccacg tgctgtat cc 156 0 
aggacatcca tggtaccctc cacctggaga ggcttgccta 162 0 
agttgctgtg tgagttgggg cggcccagca tggtctggct 16 80 
actgtgggga cagaaccttc tatgacccgg agcccatcct 1740 
actagctggg tgcacatatc aaatgcttca ttctgcatac 1800 
gtgcatgcat cttgaagcaa caaagcagcc acagtttcag 1860 
aggaaaacat gttcagtgag gaaaaaacat tcagacaaat 192 0 
ggaagttggg gataggcaga tgttgacttg aggagttaat 1980 
tcttatagag ttagaaatag aatctgaatt tctaaaggga 2 040 
atgtaggagt taatccctgt gtagactgtt gtaaagaaac 2100 
aatgtgaagc aaaaaaaaaa aaaaaaaa 2148 



<210> 202 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD06115 



<400> 202 

tggggagata catcttatag agttagaaat agaatctgaa tttctaaagg gagattctgg 60 
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<210> 203 
<211> 1051 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_006332 



<400> 203 

ggaccgccgc 

tgcccctgct 

gctggacgtc 

gaatgggcca 

gtccaatgca 

agccttcctg 

cacgctggtg 

gtgccagcat 

acttgacatg 

gagaagtctg 

ggagtgtgca 

tgctctccag 

agatcagacc 

ctgcccttcc 

cggagagctc 

cctcggcacc 

caaaattcca 

agtttaataa 



ctggttaaag 
gcgatgaccc 
cccacggcgg 
ccagttaact 
ccgcttgtca 
atccgggagc 
ccctacggaa 
ggagaagagg 
gagctagcct 
ccactatgcc 
atgggggacc 
ccaccacacg 
cagctcctta 
tcaaccagct 
atggaaggcg 
tgctacttac 
ccccatgatc 
gcaaaaaaaa 



gcgcttattt 
tgtcgccact 
cggtgcaggc 
acaagacagg 
atgtgaccct 
tcttcccaac 
acgcacagga 
agtgcaaatt 
tcctgaccat 
tgcagctcta 
gcggcatgca 
agtatgtgcc 
cccttgtctg 
ccctcaggag 
agtgggaacc 
caactggaaa 
aagaatcctg 
aaaaaaaaaa 



cccaggcagc 
tctgctgttc 
gtcccctctg 
caatctatac 
ctactatgaa 
atggctgttg 
acaaaatgtc 
caacaaggtg 
tgtctgcatg 
cgccccaggg 
gctcatgcac 
ctgggtcacc 
ccagttgtac 
tgtttgcttc 
cggctgcctg 
attttatgca 
ctccactaag 
a 1051 



cgctgcagtc 
ctgccacc.gc 
caagcgttag 
ctgcgggggc 
gcactgtgcg 
gtcatggaga 
agtggcaggt 
gaggcctgcg 
gaagagtttg 
ctgtcgccag 
gccaacgccc 
gtcaatggga 
cagggcaaga 
aagtgatggc 
cctttttttc 
tcccatgaag 
aatggtgcta 



gccacacctt 
tgctgctgct 
acttctttgg 
ccctgaagaa 
gtggctgccg 
tcctcaatgt 
gggagttcaa 
tgttggatga 
aggacatgga 
acactatcat 
agcggacaga 
aacccttgga 
agccggatgt 
cggtgagctg 
tgatccagac 
cccagataca 
aagtaaaact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 



<210> 204 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_006332 



<400> 204 

aaattccacc cctagatcaa gaatcctgct ccactaagaa tggtgctaaa gtaaaactag 60. 
<210> 205 



<211> 1714 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006417 



<400> 205 

ggggcatttt gtgcctgcct agctatccag 
actacagaca gtacaacaga tcaagaagta 
acgaaaagat cctgcaaaat cattttggag 
gtgtccatgg attccgtaat ggagttfctgc 
taacagtgat ttatagtgaa gatcatatta 
aaggaaagta tgcttccatc atcctttttg 
aactaggact atgtacacca gaaacactgt 
caactaattt ccagatagat ggaagaaata 
aaaatcttgg acttgctcaa aattgtacta 
gcgaagattc actggatgaa agaaagataa 
tgtctgcctt gagaacttat gaaccatatg 
tgctgggtcc aattggagct gggaagtcca 



acagagcagc taccctcagc tctagctgat 60 

tggcagtgac aactcgtttg acacggttgc 12 0 

ggaagcggct tagccttctc tataagggta 180 

ttgacagatg ttgtaatcaa gggcctactc 240 

ttggagcata tgcggaagag agttaccagg 3 00 

cacttcaaga tactaaaatt tcagaatgga 3 60 

tttgttgtga tgttacaaaa tataactccc 42 0 

gaaaagtgat tatggactta aagacaatgg 480 

tctctattca ggattatgaa gtttttcgat 540 

aaggggtcat tgagctcagg aagagcttac 600 

gatccctggt tcaacaaata cgaattctgc 660 

gctttttcaa ctcagtgagg tctgttttcc 72 0 
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aagggcatgt aacgcatcag gctttggtgg gcactaatac aactgggata tctgagaagt 780 

ataggacata ctctattaga gacgggaaag atggcaaata cc tgccgttt attctgtgtg 840 

actcactggg gctgagtgag aaagaaggcg gcctgtgcag ggatgacata ttctatatct 900 

tgaacggtaa cattcgtgat agataccagt ttaatcccat ggaatcaatc aaattaaatc 960 

atcatgacta cattgattcc ccatcgctga aggacagaat tcattgtgtg gcatttgtat 102 0 

ttgatgccag ctctattcaa tacttctcct ctcagatgat agtaaagatc aaaagaattc 1080 

gaagggagtt ggtaaacgct ggtgtggtac atgtggcttt gctcactcat gtggatagca 1140 

tggatttgat tacaaaaggt gaccttatag aaatagagag atgtgagcct gtgaggtcca 12 00 

agctagagga agtccaaaga aaacttggat ttgctctttc tgacatctcg gtggttagca 12 60 

attattcctc tgagtgggag ctggaccctg taaaggatgt tctaattctt tctgctctga 132 0 

gacgaatgct atgggctgca gatgacttct tagaggattt gccttttgag caaataggga 1380 

atctaaggga ggaaattatc aactgtgcac aaggaaaaaa atagatatgt gaaaggttca 1440 

cgtaaatttc ctcacatcac agaagattaa aattcagaaa ggagaaaaca cagaccaaag 1500 

agaagtatct aagaccaaag ggatgfcgttt tattaatgtc taggatgaag aaatgcatag 1560 

aacattgtag tacttgtaaa taactagaaa taacatgatt tagtcataat tgtgaaaaat 1620 

agtaataatt tttcttggat ttatgttctg tatctgtgaa aaaataaatt tcttataaaa 1680 
ctcggaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 1714 

<210> 206 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006417 



<400> 206 

atgacatatt ctatatcttg aacggtaaca ttcgtgatag ataccagttt aatcccatgg 60 

<210> 207 

<211> 3791 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006461 



<400> 207 

acagacggcg ggtgaacatg gcgtcctcga cttggtctga gacgtgatag gcctgccttc 60 

tggttgaaga tgtggcgagt gaaaaaactg agcctcagcc tgtcgccttc gccccagacg 12 0 

ggaaaaccat ctatgagaac tcctctccgt gaacttaccc tgcagcccgg tgccctcacc 180 

acctctggaa aaagatcccc cgcttgctcc tcgctgaccc catcactgtg caagctgggg 240 

ctgcaggaag gcagcaacaa ctcgtctcca gtggattttg taaataacaa gaggacagac 3 00 

ttatcttcag aacatttcag tcattcctca aagtggctag aaacttgtca gcatgaatca 360 

gatgagcagc ctctagatcc aattccccaa attagctcta ctcctaaaac gtctgaggaa 420 

gcagtagacc cactgggcaa ttatatggtt aaaaccatcg tccttgtacc atctccactg 480 

gggcagcaac aagacatgat atttgaggcc cgtttagata ccatggcaga gacaaacagc 540 

atatctttaa atggaccttt gagaacagac gatctggtga gagaggaggt ggcaccctgc 600 

atgggagaca ggttttcaga agttgctgct gtatctgaga aacctatctt tcaggaatct 660 

ccgtcccatc tcttagagga gtctccacca aatccctgtt ctgaacaact acattgctcc 720 

aaggaaagcc tgagcagtag aactgaggct gtgcgtgagg acttagtacc ttctgaaagt 7 80 

aacgccttct tgccttcctc tgttctctgg ctttcccctt caactgcctt ggcagcagat 840 

ttccgtgtca atcatgtgga cccagaggag gaaattgtag agcatggagc tatggaggaa 900 

agagaaatga ggtttcccac acatcctaag gagtctgaaa cagaagatca agcacttgtc 960 

tcaagtgtgg aagatattct gtccacatgc ctgacaccaa atctagtaga aatggaatcc 1020 

caagaagctc caggcccagc agtagaagat gttggtagga ttcttggctc tgatacagag 1080 

tcttggatgt ccccactggc ctggctggaa aaaggtgtaa atacctccgt catgctggaa 1140 

aatctccgcc aaagcttatc ccttccctcg atgcttcggg atgctgcaat tggcactacc 12 00 

cctttctcta cttgctcggt ggggacttgg tttactcctt cagcaccaca ggaaaagagt 12 60 

acaaacacat cccagacagg cctggttggc accaagcaca gtacttctga gacagagcag 132 0 

ctcctgtgtg gccggcctcc agatctgact gccttgtctc gacatgactt ggaagataac 13 80 
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ctgctgagct 
cagctggctg 
agtgggataa 
cagcaggcca 
cttcacctat 
cgtgcagaaa 
cagagcctca 
aaggatgcgg 
cagctggaac 
acccaactgg 
acttctacct 
ttgctgagtc 
caggaacgtg 
tctgcccagt 
ctagcaacag 
gagctacaga 
tgccagctta 
ctaaaacaca 
gacctgaaag 
ggtcaggttg 
tgtgagaacc 
gataaccagg 
actgagcaac 
caagagaccc 
gacaggacct 
cctgtgccct 
cttcagcccg 
actgagcttc 
ctgcagcgaa 
gaagtccaga 
tacaagaatg 
atagacaaga 
cttcggcgtg 
ccaactgcca 
tggacaaact 
gttccagagc 
aaaaactaga 
tgaggggatg 
agaatttgtt 
ggcatagagc 
aaaaaaaaaa 



ctcttgtcat 
tccctcaccc 
ctaataaact 
gaaatgtcat 
ccctgttgca 
cafctggtctg 
aagcagaaag 
cagagatagt 
aggacctagc 
tagggcttca 
tgcaacaaga 
ggtcccgaca 
atgtggcaat 
tagaggagtg 
atctccgggc 
gtcagcatac 
cccagagcaa 
tgcaggcaga 
agaccttgga 
agtgtcaatt 
tcaaggacac 
agcaagatct 
tacagagcct 
ttctgctgag 
tcctgggaag 
tgcttggaag 
cagagacccc 
agagtctttg 
aaatttgtga 
aggcaaaaga 
aaaaggagct 
gtggcgagct 
cggagacaga 
gcctatggcc 
gagagtgatg 
ccatagaaat 
tgacattgtt 
caaagaacta 
tcacctcttt 
caactgagat 
a 3791 



tgtggagttt 
agaaacccag 
tcagcatctt 
gcaatcatgg 
tttagaagaa 
ttgctgtttt 
ggaggaggca 
gttggaggct 
atccatgcgg 
tgccaagcaa 
ctggaggtcc 
actcacagag 
tgaggaaaag 
caaaggccaa 
tcagttgcag 
ccattgtgcc 
tgaggagcag 
actgcagcag 
gtttgcagac 
gaaaaccaca 
tgtagagaac 
ggagaaaaca 
gactctcttt 
tacagcctgt 
catcttgaca 
tgacaagagt 
aggcatggag 
ttccctgcta 
gctgcaagct 
agcagacata 
ccaggaagtg 
cataagcctt 
gaccaaagtg 
accaattgga 
ttcctggaga 
atcctagagg 
cagcatattt 
cagggattgc 
ttacctgcaa 
aaatgctatt 



ctctcccgcc 
gacagtagca 
aaggagagcc 
gtgcttatct 
gataagacta 
gatttgctga 
aggcacagag 
ttctgtgcac 
gaattcagag 
gaagagctgg 
atgcaactgg 
aaactcacag 
caggaggttt 
acagaacaac 
attctggcca 
caggacctgg 
gctgctcaat 
caacaagctg 
caggagaatc 
ctggaagtgc 
ctaacggcta 
cggcagtact 
ctacagacaa 
cctcccaccc 
gcagtggcag 
gctttcaccc 
gagagcctgg 
caagagtcta 
aggctgcagg 
gagaagctga 
atacagcaga 
agagaggagg 
ctccaggagg 
tccaggagaa 
tgaaaaatga 
agaaccttcg 
ataagaccct 
tggaatttct 
taccccctta 
taaataaagt 



agcttcggga 
cacagactga 
atgagatggg 
ctaaagagct 
ctgtgaatca 
agaaattgag 
aggaaatggc 
acgccagcca 
gccttctgaa 
ttcagcagac 
attatacaac 
tcaagagcca 
ctagggtgct 
tggagttgga 
acatggacag 
ctatgaagga 
gcgtaaagga 
tcctggccaa 
aggttgctca 
tccgggagcg 
aactggccag 
ctcaaaagct 
aactaaagga 
aggaacaccc 
atgaagagcc 
gagtagcatc 
cagaaatgag 
aagaagaagc 
cccaggaaga 
accaggcctt 
atgagaagat 
tgacccacct 
cctggcaggc 
agtgtggctc 
gaaggaaaac 
gcgctctgac 
gctctctatt 
gagctaagaa 
ccccaatacc 
gtatttaatg 



ctggaagagc 
cacatctcac 
acaggcccta 
gatatccttg 
ggagtctcgg 
ggcaaagctc 
tctcagaggc 
gcgcatcagc 
ggatgcccag 
agtgagtctt 
atggacagct 
gcaagccctg 
ggaacaagtc 
aaacattcgt 
ccagctaaaa 
tgagttactc 
agagatggca 
agaggtgcgg 
cctggagctg 
cagcttgcag 
caccatagca 
agggctgctg 
gaagactgaa 
tctgcctaat 
agaatcaact 
aatggtttcc 
tattatgact 
catcaggact 
acagcatcag 
gtgcttgcgc 
cctagaacag 
tacccgctca 
cagctggact 
tctcaggagg 
tcctgatcaa 
aaggagttag 
ccagaggtgg 
actgaaagcc 
aagaccaact 
aaaaaaaaaa 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



<210> 208 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006461 



<400> 208 

ctgacaagga gttagaaaaa ctagatgaca ttgttcagca tatttataag accctgctct 

<210> 209 

<211> 2856 

<212> DNA 

<213> Homo sapiens 



60 



<300> 

<308> NM_006516 
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60 



900 
960 
1020 



<400> 209 

tagtcgcggg tccccgagtg agcacgccag ggagcaggag accaaacgac gggggtcgga 

gtcagagtcg cagtgggagt ccccggaccg gagcacgagc ctgagcggga gagcgccgct 12 0 

cgcacgcccg tcgccacccg cgtacccggc gcagccagag ccaccagcgc agcgctgcca 180 

tggagcccag cagcaagaag ctgacgggtc gcctcatgct ggctgtggga ggagcagtgc 240 

ttggctccct gcagtttggc tacaacactg gagtcatcaa tgccccccag aaggtgatcg 300 

aggagttcta caaccagaca tgggtccacc gctatgggga gagcatcctg cccaccacgc 360 

tcaccacgct ctggtccctc tcagtggcca tcttttctgt tgggggcatg attggctcct 42 0 

tctctgtggg ccttttcgtt aaccgctttg gccggcggaa ttcaatgctg atgatgaacc 480 

tgctggcctt cgtgtccgcc gtgctcatgg gcttctcgaa actgggcaag tcctttgaga 540 

tgctgatcct gggccgcttc atcatcggtg tgtactgcgg cctgaccaca ggcttcgtgc 600 

ccatgtatgt gggtgaagtg tcacccacag cctttcgtgg ggccctgggc accctgcacc 660 

agctgggcat cgtcgtcggc atcctcatcg cccaggtgtt cggcctggac tccatcatgg 72 0 

gcaacaagga cctgtggccc ctgctgctga gcatcatctt catcccggcc ctgctgcagt 7 80 

gcatcgtgct gcccttctgc cccgagagtc cccgcttcct gctcatcaac cgcaacgagg 840 
agaaccgggc caagagtgtg ctaaagaagc tgcgcgggac agctgacgtg acccatgacc 
tgcaggagat gaaggaagag agtcggcaga tgatgcggga gaagaaggtc accatcctgg 
agctgttccg ctcccccgcc taccgccagc ccatcctcat cgctgtggtg ctgcagctgt 

cccagcagct gtctggcatc aacgctgtct tctattactc cacgagcatc ttcgagaagg 1080 

cgggggtgca gcagcctgtg tatgccacca ttggctccgg tatcgtcaac acggccttca 1140 

ctgtcgtgtc gctgtttgtg gtggagcgag caggccggcg gaccctgcac ctcataggcc 120 0 

tcgctggcat ggcgggttgt gccatactca tgaccatcgc gctagcactg ctggagcagc 12 6 0 

taccctggat gtcctatctg agcatcgtgg ccatctttgg ctttgtggcc ttctttgaag 132 0 

tgggtcctgg ccccatccca tggttcatcg tggctgaact cttcagccag ggtccacgtc 1380 

cagctgccat tgccgttgca ggcttctcca actggacctc aaatttcatt gtgggcatgt 144 0 

gcttccagta tgtggagcaa ctgtgtggtc cctacgtctt catcatcttc actgtgctcc 150 0 

tggttctgtt cttcatcttc acctacttca aagttcctga gactaaaggc cggaccttcg 1560 

atgagatcgc ttccggcttc cggcaggggg gagccagcca aagtgataag acacccgagg 162 0 

agctgttcca tcccctgggg gctgattccc aagtgtgagt cgccccagat caccagcccg 168 0 

gcctgctccc agcagcccta aggatctctc aggagcacag gcagctggat gagacttcca 1740 

aacctgacag atgtcagccg agccgggcct ggggctcctt tctccagcca gcaatgatgt 180 0 

ccagaagaat attcaggact taacggctcc aggattttaa caaaagcaag actgttgctc 1860 

aaatctattc agacaagcaa caggttttat aattttttta ttactgattt tgttattttt 192 0 

atatcagcct gagtctcctg tgcccacatc ccaggcttca ccctgaatgg ttccatgcct 19 80 

gagggtggag actaagccct gtcgagacac ttgccttctt cacccagcta atctgtaggg 2 040 

ctggacctat gtcctaagga cacactaatc gaactatgaa ctacaaagct tctatcccag 2100 

gaggtggcta tggccacccg ttctgctggc ctggatctcc ccactctagg ggtcaggctc 216 0 

cattaggatt tgccccttcc catctcttcc tacccaacca ctcaaattaa tctttcttta 2220 

cctgagacca gttgggagca ctggagtgca gggaggagag gggaagggcc agtctgggct 22 80 

gccgggttct agtctccttt gcactgaggg ccacactatt accatgagaa gagggcctgt 2 340 

gggagcctgc aaactcactg ctcaagaaga catggagact cctgccctgt tgtgtataga 2 400 

tgcaagatat ttatatatat ttttggttgt caatattaaa tacagacact aagttatagt 2460 

atatctggac aagccaactt gtaaatacac cacctcactc ctgttactta cctaaacaga 2 520 

tataaatggc tggtttttag aaacatggtt ttgaaatgct tgtggattga gggtaggagg 2 5 80 

tttggatggg agtgagacag aagtaagtgg ggttgcaacc actgcaacgg cttagacttc 2 640 

gactcaggat ccagtccctt acacgtacct ctcatcagtg tcctcttgct caaaaatctg 27 00 

tttgatccct gttacccaga gaatatatac attctttatc ttgacattca aggcatttct 2 7 60 

atcacatatt tgatagttgg tgttcaaaaa aacactagtt ttgtgccagc cgtgatgctc 2 82 0 
aggcttgaaa tcgcattatt ttgaatgtga agggaa 2 856 

<210> 210 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006516 
<400> 210 

aaacagatat aaatggctgg tttttagaaa catggttttg aaatgcttgt ggattgaggg 60 



95 



WO 2005/086891 



PCT/US2005/007894 



<210> 211 
<211> 576 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> NM_006607 



<400> 211 

atggctactc tgatctacgt tgataaggaa 
aaggatgtgc tgaagctgga gtctagacct 
gttttaacac cacgttttgg caaaacatac 
agaaaggctt tgggcactgt caacagagct 
agaaaacaaa aacagccaag cttttctgcc 
aaaagttctg ttcctgcctc agatgacgcc 
aatcttctag actttgagag ttttgacctg 
ttgagtggag tgcctctcat gatccttgat 
ctgggccccc cttcacctgt gaaaatgccc 
gtctccttca agcattctgt cgaccctgga 



attggagaac caggcacccg tgtggctgcc 60 

tcaatcaaag cattagatgg gatatctcaa 12 0 

gatgctccat cagccttacc taaagctacc 180 

acagaaaagt cagtaaagac caatggaccc 240 

aaaaagatga ccgagaagac tgttaaaaca 300 

tatccagaaa tagaaaaatt ctttcccttc 360 

cctgaagagc gccagattgc acacctcccc 42 0 

gaggagggag agcttgaaaa gctgtttcag 4 80 

tctccaccat gggaatgcaa tctgtttgca 540 
tgttga 576 



<210> 212 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_006607 



<400> 212 

cgcctatcca gaaatagaaa aattctttcc cttcaatctt ctagactttg agagttttga 6 0 



<210> 213 

<211> 2058 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006820 



<400> 213 

gcacgaggaa gccacagatc tcttaagaac tttctgtctc caaaccgtgg ctgctcgata 60 

aatcagacag aacagttaat cctcaattta agcctgatct aacccctaga aacagatata 12 0 

gaacaatgga agtgacaaca agattgacat ggaatgatga aaatcatctg cgcaactgct 180 

tggaaatgtt tctttgagtc ttctctataa gtctagtgtt catggaggta gcattgaaga 240 

tatggttgaa agatgcagcc gtcagggatg tactataaca atggcttaca ttgattacaa 3 00 

tatgattgta gcctttatgc ttggaaatta tattaattta cgtgaaagtt ctacagagcc 3 60 

aaatgattcc ctatggtttt cacttcaaaa gaaaaatgac accactgaaa tagaaacttt 42 0 

actcttaaat acagcaccaa aaattattga tgagcaactg gtgtgtcgtt tatcgaaaac 480 

ggatattttc attatatgtc gagataataa aatttatcta gataaaatga taacaagaaa 540 

cttgaaacta aggttttatg gccaccgtca gtatttggaa tgtgaagttt ttcgagttga 600 

aggaattaag gataacctag acgacataaa gaggataatt aaagccagag agcacagaaa 660 

taggcttcta gcagacatca gagactatag gccctatgca gacttggttt cagaaattcg 720 

tattcttttg gtgggtccag ttgggtctgg aaagtccagt tttttcaatt cagtcaagtc 7 80 

tatttttcat ggccatgtga ctggccaagc cgtagtgggg tctgatacca ccagcataac 840 

cgagcggtat aggatatatt ctgttaaaga tggaaaaaat ggaaaatctc tgccatttat 9 00 

gttgtgtgac actatggggc tagatggggc agaaggagca ggactgtgca tggatgacat 960 

tccccacatc ttaaaaggtt gtatgccaga cagatatcag tttaattccc gtaaaccaat 102 0 

tacacctgag cattctactt ttatcacctc tccatctctg aaggacagga ttcactgtgt 1080 

ggcttatgtc ttagacatca actctattga caatctctac tctaaaatgt tggcaaaagt 114 0 

gaagcaagtt cacaaagaag tattaaactg tggtatagca tatgtggcct tgcttactaa 12 00 
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agtggatgat 

ttctcaaagc 

gatggttgga 

ctctgcactg 

ggaaactggt 

ctgacatttg 

ctgcttcaga 

gttcaaaggc 

tccatatttc 

aaaaatgaga 

ctaatcacat 

tgtgagttct 

cagtgcctgg 

tcacattcac 

aaaccccatc 



tgcagtgagg 
cgggtcatga 
aattatgctt 
aggcagatgc 
gcaattgaga 
gcccagcctg 
ttttgctttt 
caaaacctga 
cgtaccattt 
tattctctaa 
tgaataatag 
ttgagaaaca 
taatgagcaa 
tttaaatttt 
tccactgc 



ttcttcaaga 
atgtccataa. 
cagatttgga 
tgcgggctgc 
gagcgttaca 
tactggtgtg 
gttcgttttg 
gaagcggtgg 
acaattcagt 
tttattcttc 
ttataaaatt 
gcgtggattt 
gcatacttgc 
tctgtatata 
2058 



caacttttta 

aatgctaggc 

actggacccc 

agatgatttt 

gccctgcatt 

ccgcaatgag 

ccttctgtcc 

gctaagatag 

ttctgtgaca 

tataacactc 

attgtataga 

tacttatctg 

cattactttt 

gaaaggaaaa 



aacatgagta 

attcctattt 

atgaaggata 

ttagaagatt 

tgagataagt 

agtcaatctc 

ttggaacagt 

gtcctactgc 

tctttttaaa 

tatatagagc 

catctgcttc 

tgtattcaca 

ccttcccact 

ctagcctggg 



gatctatgac 

ccaatatttt 

ttctcatcct 

tgcctcttga 

tgccttgatt 

tattgacagc 

catatctcaa 

aaaccacccc 

ccactggagg 

tatgtgagta 

ttaaacagat 

gagcttagca 

ctctccaaca 

caacatgatg 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 



<210> 214 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006820 

tgagJtcttt: gagaaacagc gtggatttta cttatctgtg tattcacaga gcttagcaca 60 

<210> 215 

<211> 2825 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_006845 



<400> 215 
gcgaaattga 
atggactcgt 
aatggtttaa 
gtggaatggg 
gcaataaacc 
■ caggaaaatg 
ccaaaagaaa 
acggctcagg 
ttttcagttc 
aggatggtca 
aaccctgtga 
aacaagcgag 
tatgacagta 
ttggaatgtc 
gttaggaaac 
attcctagca 
tatctggaga 
gttgtctaca 
acttgttttg 
tctgggaaag 
ctcctgaaga 
gagatctaca 
gaggacggca 
gatgatgtca 
gccaactcca 



ggtttcttgg 
cgcttcaggc 
ttcacagtgc 
cagaaggagg 
cagaactctt 
taacaatcca 
gtcttcgaag 
agaatgacat 
ctcctgcccc 
gcgaggagat 
actcagttcg 
aagagaagaa 
gttttccaaa 
atccacttac 
gcccactgaa 
agtgtctcct 
accaagcatt 
ggttcacagc 
catatggcca 
cccagaatgc 
atcaaccctg 
atgggaagct 
agcaacaggt 
tcaagatgct 
attcctcccg 



tattgcgcgt 
ccgcctgttt 
caatgtaagg 
tgccacaaag 
acagcttctt 
gaaacaaaaa 
ccgctccact 
ggaggtggag 
cactaggcct 
ggaagagcaa 
gaggaaatca 
ggcccagaac 
ctgggaattt 
tatgactgat 
taagcaagaa 
cttggtacat 
ctgctttgac 
aaggccactg 
gacaggaagt 
atccaaaggg 
ctaccggaag 
gtttgacctg 
gcaagtggtg 
cgacatgggc 
ctcccacgcg 



ttctcttcct 
cccggtctcg 
actgtgaact 
ggcaaagaga 
cccttacatc 
cggagatccg 
cgcatgtcca 
ctgcctgcag 
tcctgccctg 
gtccattcca 
tgtcttgtga 
tctgaaatga 
gcccgaatga 
cctatcgaag 
ttggccaaga 
gaacccaagt 
tttgcatttg 
gtacagacaa 
ggcaagacac 
atctatgcca 
ttgggcctgg 
ctcaacaaga 
gggctgcagg 
agcgcctgca 
tgcttccaaa 



tgctgactct 
ctatcaagat 
tggagaaatc 
ttgattttga 
cgaaggacaa 
tcaactccaa 



ctgtctcaga 
ctgcaaactc 
cagtggctga 
tccgtggcag 
aggaagtgga 
gaatgaagag 
ttaaagaatt 
agcacagaat 
aagaaattga 
tgaaagtgga 
atgaaacagc 
tctttgaagg 
atactatggg 
tggcctcccg 
aagtctatgt 
aggccaagct 
agcatctggt 
gaacctctgg 
ttattcttcg 



ccgaatggcc 
ccaacgcagt 
ctgtgtttca 
tgatgtggct 
tctgcccttg 
aattcctgct 
gcttcgcatc 
ccgcaagcag 
aataccattg 
ctcttctgca 
aaaaatgaag 
agctcaggag 
tcgggctact 
atgtgtctgt 
tgtgatttcc 
cttaacaaag 
ttcgaatgaa 
tggaaaagca 
cggagacctc 
ggacgtcttc 
gacattcttc 
gcgcgtgctg 
taactctgct 
gcagacattt 
agctaaaggg 



60 

12 0 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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agaatgcatg gcaagttctc tttggtagat ctggcaggga atgagcgagg cgcagacact 1560 

tccagtgctg accggcagac ccgcatggag ggcgcagaaa tcaacaagag tctcttagcc 1620 

ctgaaggagt gcatcagggc cctgggacag aacaaggctc acaccccgtt ccgtgagagc 1680 

aagctgacac aggtgctgag ggactccttc attggggaga actctaggac ttgcatgatt 1740 

gccacgatct caccaggcat aagctcctgt gaatatactt taaacaccct gagatatgca 1800 

gacagggtca aggagctgag cccccacagt gggcccagtg gagagcagtt gattcaaatg 1860 

gaaacagaag agatggaagc ctgctctaac ggggcgctga ttccaggcaa tttatccaag 1920 

gaagaggagg aactgtcttc ccagatgtcc agctttaacg aagccatgac tcagatcagg 1980 

gagctggagg agaaggctat ggaagagctc aaggagatca tacagcaagg accagactgg 2 040 

cttgagctct ctgagatgac cgagcagcca gactatgacc tggagacctt tgtgaacaaa 2100 

gcggaatctg ctctggccca gcaagccaag catttctcag ccctgcgaga tgtcatcaag 2160 

gccttacgcc tggccatgca gctggaagag caggctagca gacaaataag cagcaagaaa 2220 

cggccccagt gacgactgca aataaaaatc tgtttggttt gacacccagc ctcttccctg 22 80 

gccctcccca gagaactttg ggtacctggt gggtctaggc agggtctgag ctgggacagg 2340 

ttctggtaaa tgccaagtat gggggcatct gggcccaggg cagctgggga gggggtcaga 2400 

gtgacatggg acactccttt tctgttcctc agttgtcgcc ctcacgagag gaaggagctc 2460 

ttagttaccc ttttgtgttg cccttctttc catcaagggg aatgttctca gcatagagct 2 52 0 

ttctccgcag catcctgcct gcgtggactg gctgctaatg gagagctccc tggggttgtc 2 580 

ctggctctgg ggagagagac ggagccttta gtacagctat ctgctggctc taaaccttct 2640 

acgcctttgg gccgagcact gaatgtcttg tactttaaaa aaatgtttct gagacctctt 27 0 0 

tctactttac tgtctcccta gagtcctaga ggatccctac tgttttctgt tttatgtgtt 2760 

tatacattgt atgtaacaat aaagagaaaa aataaaaaaa aaaaaaaaaa aaaaaaaaaa 2820 
aaaaa 2825 

<210> 216 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006845 
<400> 216 

aaatgtttct gagacctctt tctactttac tgtctcccta gagtcctaga ggatccctac 60 

<210> 217 
<211> 823 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_007019 
<400> 217 

aaacgcgggc gggcgggccc gcagtcctgc agttgcagtc gtgttctccg agttcctgtc 60 

tctctgccaa cgccgcccgg atggcttccc aaaaccgcga cccagccgcc actagcgtcg 12 0 

ccgccgcccg taaaggagct gagccgagcg ggggcgccgc ccggggtccg gtgggcaaaa 180 

ggctacagca ggagctgatg accctcatga tgtctggcga taaagggatt tctgccttcc 240 

ctgaatcaga caaccttttc aaatgggtag ggaccatcca tggagcagct ggaacagtat 3 00 

atgaagacct gaggtataag ctctcgctag agttccccag tggctaccct tacaatgcgc 3 60 

ccacagtgaa gttcctcacg ccctgctatc accccaacgt ggacacccag ggtaacatat 42 0 

gcctggacat cctgaaggaa aagtggtctg ccctgtatga tgtcaggacc attctgctct 480 

ccatccagag ccttctagga gaacccaaca ttgatagtcc cttgaacaca catgctgccg 540 

agctctggaa aaaccccaca gcttttaaga agtacctgca agaaacctac tcaaagcagg 6 00 

tcaccagcca ggagccctga cccaggctgc ccagcctgtc cttgtgtcgt ctttttaatt 660 

tttccttaga tggtctgtcc tttttgtgat ttctgtatag gactctttat cttgagctgt 72 0 

ggtatttttg ttttgttttt gtcttttaaa ttaagcctcg gttgagccct tgtatattaa 7 80 
ataaatgcat ttttgtcctt ttttagacaa aaaaaaaaaa aaa 823 

<210> 218 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007019 
<400> 218 

tggaaaaacc ccacagcttt taagaagtac ctgcaagaaa cctactcaaa gcaggtcacc 60 

<210> 219 

<211> 2831 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007183 
<400> 219 

gaattccgga caggacgtga agatagttgg gtttggaggc ggccgccagg cccaggcccg 60 

gtggacctgc cgccatgcag gacggtaact tcctgctgtc ggccctgcag cctgaggccg 12 0 

gcgtgtgctc cctggcgctg ccctctgacc tgcagctgga ccgccggggc gccgaggggc 180 

cggaggccga gcggctgcgg gcagcccgcg tccaggagca ggtccgcgcc cgcctcttgc 240 

agctgggaca gcagccgcgg cacaacgggg ccgctgagcc cgagcctgag gccgagactg 3 00 

ccagaggcac atccaggggg cagtaccaca ccctgcaggc tggcttcagc tctcgctctc 3 60 

agggcctgag tggggacaag acctcgggct tccggcccat cgccaagccg gcctacagcc 42 0 

cagcctcctg gtcctcccgc tccgccgtgg atctgagctg cagtcggagg ctgagttcag 480 

cccacaatgg gggcagcgcc tttggggccg ctgggtacgg gggtgcccag cccacccctc 540 

ccatgcccac caggcccgtg tccttccatg agcgcggtgg ggttgggagc cgggccgact 600 

atgacacact ctccctgcgc tcgctgcggc tggggcccgg gggcctggac gaccgctaca 66 0 

gcctggtgtc tgagcagctg gagcccgcgg ccacctccac ctacagggcc tttgcgtacg 72 0 

agcgccaggc cagctccagc tccagccggg caggggggct ggactggccc gaggccactg 780 

aggtttcccc gagccggacc atccgtgccc ctgccgtgcg gaccctgcag cgattccaga 840 

gcagccaccg gagccgcggg gtaggcgggg cagtgccggg ggccgtcctg gagccagtgg 900 

ctcgagcgcc atctgtgcgc agcctcagcc tcagcctggc tgactcgggc cacctgccgg 9 60 

acgtgcatgg gttcaacagc tacggtagcc accgaaccct gcagagactc agcagcggtt 102 0 

ttgatgacat tgacctgccc tcagcagtca agtacctcat ggcttcagac cccaacctgc 1080 

aggtgctggg agcggcctac atccagcaca agtgctacag cgatgcagcc gccaagaagc 1140 

aggcccgcag ccttcaggcc gtgcctaggc tggtgaagct cttcaaccac gccaaccagg 12 00 

aagtgcagcg ccatgccaca ggtgccatgc gcaacctcat ctacgacaac gctgacaaca 12 60 

agctggccct ggtggaggag aacgggatct tcgagctgct gcggacactg cgggagcagg 13 2 0 

atgatgagct tcgcaaaaat gtcacaggga tcctgtggaa cctttcatcc agcgaccacc 13 80 

tgaaggaccg cctggccaga gacacgctgg agcagctcac ggacctggtg ttgagccccc 1440 

tgtcgggggc tgggggtccc cccctcatcc agcagaacgc ctcggaggcg gagatcttct 15 00 

acaacgccac cggcttcctc aggaacctca gctcagcctc tcaggccact cgccagaaga 15 60 

tgcgggagtg ccacgggctg gtggacgccc tggtcacctc tatcaaccac gccctggacg 162 0 

cgggcaaatg cgaggacaag agcgtggaga acgcggtgtg cgtcctgcgg aacctgtcct 1680 

accgcctcta cgacgagatg ccgccgtccg cgctgcagcg gctggagggt cgcggccgca 17 40 

gggacctggc gggggcgccg ccgggagagg tcgtgggctg cttcacgccg cagagccggc 1800 

ggctgcgcga gctgcccctc gccgccgatg cgctcacctt cgcggaggtg tccaaggacc 1860 

ccaagggcct cgagtggctg tggagccccc agatcgtggg gctgtacaac cggctgctgc 1920 

agcgctgcga gctcaaccgg cacacgacgg aggcggccgc cggggcgctg cagaacatca 19 80 

cggcaggcga ccgcaggtgg gcgggggtgc tgagccgcct ggccctggag caggagcgta 2 040 

ttctgaaccc cctgctagac cgtgtcagga ccgccgacca ccaccagctg cgctcactga 2100 

ctggcctcat ccgaaacctg tctcggaacg ctaggaacaa ggacgagatg tccacgaagg 2160 

tggtgagcca cctgatcgag aagctgccag gcagcgtggg tgagaagtcg cccccagccg 2220 

aggtgctggt caacatcata gctgtgctca acaacctggt ggtggccagc cccatcgctg 2 2 80 

cccgagacct gctgtatttt gacggactcc gaaagctcat cttcatcaag aagaagcggg 2340 

acagccccga cagtgagaag tcctcccggg cagcatccag cctcctggcc aacctgtggc 2400 

agtacaacaa gctccaccgt gactttcggg cgaagggcta tcggaaggag gacttcctgg 2460 

gcccataggt gaagccttct ggaggagaag gtgacgtggc ccagcgtcca agggacagac 252 0 

tcagctccag gctgcttggc agcccagcct ggaggagaag gctaatgacg gaggggcccc 2 5 80 
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tcgctggggc ccctgtgtgc atctttgagg gtcctgggcc accaggaggg gcagggtctt 
atagctgggg acttggcttc cgcagggcag ggggtggggc agggctcaag gctgctctgg 
tgtatggggt ggtgacccag tcacattggc agaggtgggg gttggctgtg gcctggcagt 
atcttgggat agccagcact gggaataaag atggccatga acagtcacaa aaaaaaaaaa 
aaaaggaatt c 2 831 



2640 
2700 
2760 
2820 



<210> 220 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007183 



<400> 220 



ctggcagtat cttgggatag ccagcactgg gaataaagat ggccatgaac agtcacaaaa 



60 



<210> 221 

<211> 2815 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007267 



<400> 221 
aggaagcgga 
gagcaggatg 
tagcgggcat 
accccagggg 
ttccagcagc 
cagagagagc 
cagagcacgg 
cgaggtgcca 
cggcccctgc 
gagctggacc 
cagagcctgg 
gagaaacgca 
ggcagcggcg 
cacagcctgg 
ctgaagcgca 
accctgctgg 
caggtcgcct 
ctcacaggcg 
acgctgaacc 
ggcctgccct 
acctgcatca 
agcacctctg 
aagcgggcct 
gagtggcagc 
ctggggcttg 
gtcttctcgg 
gtcctgcccc 
gccgccctgg 
ctcatcctca 
ggcgtcctgc 
gtgatggact 
atctccgaga 
ctggagctga 
cccgccgtgc 
gccaactgcc 



ggaaggtgaa 
ggggctccgg 
tggccagaga 
accagggcca 
tcatccagga 
gggaggtgac 
ccacactccg 
tcatctccca 
tcgggaactt: 
ccacggccct 
cagtggcaca 
gcctgcgaga 
gggtctgctc 
gcctggcgct 
tcgggggcca 
ctttcaatgc 
tcccacccgc 
cgggttgctt 
agccgtgtgg 
acaacatgcc 
ccctggtgta 
gcatccacgc 
cccgcctcca 
tgcggcacag 
tgtggctgct 
agttcatgat 
tggtggttgg 
agccgcatga 
agctggccat 
agggccagtg 
tcgtcctcat 
agaagctgaa 
tttatgggca 
agatcatcaa 
aggcgccgcg 



gtaggaccga 
aggtggcgcc 
catggcccag 
gggccccagc 
gcagagccag 
aggaagtagc 
catcctggcc 
gtactacaac 
tgtccgctcc 
ggaggaggag 
gcgggaccac 
gaagagcagg 
ctgctgtggc 
gctctccgcc 
gttcggctcc 
cctcctgctg 
cctgccgggc 
cacccacacc 
cagccccctg 
cctggcctac 
cagcatggct 
catcaccgtc 
gcaggacaat 
ccccaggagc 
gtgtctgggg 
ccagagtcca 
cctcctcaac 
ctccccggta 
cc tggggaca 
ctgggaggat 
gttgctggac 
gaggaggcgg 
gactctgacc 
gctgctgctc 
ccggccctgg 



attcctgtgc 
caggctctga 
ccactggcct 
ccctatgatg 
tgcacggccc 
cagcagacac 
agcatgccca 
cgcacggtgc 
gcctggccca 
gagaagcaga 
atgcttcgcg 
accccgaggg 
cggctcagat 
ctgcaggccc 
agcgtgctct 
ctgctgctgg 
cctgcccccg 
gtcatgtact 
gatggcagcc 
ctctccactg 
cactctttcg 
ttctgctcct 
attcgcaccc 
gtgtgcggga 
accgcgctgg 
gaggctgctg 
ctgggggccc 
ctggaggtgt 
ctgtgctacc 
tttgtgggcc 
acgctttttg 
aagccggagt 
tggctggggg 
gtcttctatg 
ctggcctcac 



cgaagaggcc 
gctaccctag 
tcatcctcga 
aaagcgaagt 
aggaggggct 
tctggcggcc 
gccgcaccat 
agcttcggtg 
gcctccgcct 
gcctcctggt 
ggatgccctt 
ggaagtggag 
atgcctgcgt 
tgatgccgtg 
cctacttcct 



tggccttcat 
tctgcacagg 
acggccacta 
agtgcacacc 
tgggcgtgag 
gggagagcta 
gggactacaa 
ggctgaagga 
ggctgcggca 
gctgcgccgt 
gccaggaggc 
cctacctgtg 
acgtggccat 
actggctggg 
aggagctgta 
gggaactggt 
ttgacattgc 
tgctcttctc 
tcaagaagac 
acatgagcac 



tgcagtggga 
gtctgcagac 
tgtccctgag 
gcacgactcc 
ggagctgcag 
cgagggcacc 
tggccgcagc 
caggagcagc 
gtacgacctg 
gaaggagttc 
aagcctggct 
gggccagccg 
gctggccttg 
gcgctacgcc 
ctttctcaag 
catgggccct 
cctggagctc 
cagtaacgcc 
cagggtgggt 
cttctttatc 
ccgggtgggc 
ggtgacgcag 
gctgctggcc 
ggcggctgtg 
ggccgtccac 
tgtgctgctg 
ccgtgtcctg 
ctgcaggaac 
ccgcagggtg 
ccggttcctg 
gtggaggatt 
ccggaatgtc 
gcccctcctc 
cagccttctg 
cgtcttcctc 
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acgctgctct 

caggtgaagc 

ggcagggtgt 

gtgcaccggt 

gccgtgatct 

aaggagcaga 

atctacgaga 

gcaccccctg 

gggcccgccc 

ggacaaggaa 

gggtgggtga 

attctgatta 



gcttccccgc 

cctcgagcac 

gggtgcgcca 

acctgatgga 

acctcaacat 

tcagcaatga 

ggaaggagag 

ccctgctcac 

agcaccctga 

gacagtttcg 

atccggcttg 

aaggggtttt 



cttcctgggc 

ctgcggcccc 

cctggaggcg 

aaacaccttc 

ccaggtggtg 

gggtgaggac 

ggaggagagg 

agatgaacag 

gaccacactg 

cctctcgaaa 

ggcatcccca 

gcaaatggga 



gccgctgtct 

ttccggaccc 

gcaggcccca 

tttgtcttcc 

cggggccagc 

aaaatcttct 

agcagggttg 

gatgcctagg 

ttgcctccca 

gccgcagctg 

atgaactctg 

aaaaaaaaaa 



tcctctgcta 

tggacaccat 

gggtctcctg 

tggtgtcagc 

gcaaggtcat 

taatcaacaa 

ggacaaccga 

gggacggcga 

gtgaccctgc 

cgcctaggct 

ccctgcctgg 

aaaaaaaaaa 



cgccgtctgg 
gtacgaggcc 
gctgccctgg 
cctgctgctg 
ctgcctgctc 
gcttcactcc 
ggaggctgcg 
tgggcctcac 
tgggacacca 
ggagctggaa 
gactctattt 
aaaaa 2 815 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



<210> 222 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM__007267 



<400> 222 
ggtgaggaca 



aaatcttctt aatcaacaag cttcactcca tctacgagag gaaggagagg 



<210> 223 

<211> 1893 

<212> DNA 

<213> Homo sapiens 



60 



<300> 

<308> NMJD07274 



<400> 223 

atttaccgcc 

ccagacgctt 

tccccgcacc 

ccccacgccg 

gcggggctcg 

tcgcccccgg 

tttatggcgc 

ctgcagccgc 

acgccgtccg 

aatgtccacg 

cggcattgca 

accgacttcc 

tacacctcca 

ggtgccaaaa 

gtggacaagg 

ggccggaagc 



gcgcggagag 
ccctcggggc 
ggcgcggccc 
ccagaatcgt 
gggtggggcc 
cgcgcaggcc 
ggcccgggct 
ccgccgcatc 
ccatccagat 
gggggaccat 

acagccagaa 
tgtctcccat 
agcactctgt 
agctgaccaa 
tcctcgaggt 
ggtatgaagc 



tgagggccca 

tgccaccggg 

aaccgccacc 

acgcccgcgc 

ggcgcggggg 

ccgcctcacc 

cattcattcc 

cgccgccgca 

ctgccggatc 

cctgaagatg 

cggggagcgc 

gtgcatcggt 

ggaggtgcag 

taaggccacc 

gcctcctgtt 

ccagaagctg 



agtccgccct 

tcgggcgcgg 

gaaccttctg 

gagctctctg 

cggggtcggc 

agccccatcg 

gcgccgggcc 

gcccccagca 

atgcggccag 

atcgaggagg 

tgtgtggccg 

gaggtggcgc 

gtcaacgtga 

ctgtggtatg 

gtgtattccc 

gagcgcatgg 



gctccgccac 

ctgccgcggc 

gaagcggcgg 

cagccttggc 

gcggggaggc 

ctccacctct 

tgccagacac 

tgtcgggccc 

atgatgccaa 

caggcgccat 

ccctggctcg 

atgtcagcgc 

tgtccgaaaa 

tgcccctgtc 

ggcaggagca 

agaccaagtg 



ttaggccgcc 

tagcgggcct 

ctgcctgggc 

ggcctgggag 

cgcgttcgat 

gccctccccc 

ctgcgccctt 

agacgtcgag 

cgtggccggc 

catcagcacc 

tgtcgagcgc 

ggagatcacc 

catcctcaca 

gctgaagaat 

ggaggaggag 

gaggaacggg 



gacatcgtcc 
ttgatccacc 
accatgaagc 
atcgtcacag 
atcaccatct 
gtggacgccg 
ttcacctacg 
gagaccgagg 
gcgaagcgac 
ctcgagtagc 
aaaacccaat 



agccagtcct 
tggtggggcc 
tcatggatga 
cttccgtgga 
cgggacgcat 
accctgttgt 
tgtcgctgag 
acgagaagaa 
agggccacgc 
catggcaacg 
tcacattgag 



caacccagag 
ttcagactgc 
ggtcgccggg 
cgccattaat 
gaccttcacg 
ggacagctct 
ccaggaaggc 
gcgctttgag 
ggagcctcag 
ggcccagtgt 
agctggtgtt 



ccgaacactg 
accctgcacg 
atcgtggctg 
tttcatgaca 
agcaataagt 
cagaagcgct 
aggtcgctgc 
gaaggcaaag 
ccctagactc 
ccagtcactt 
gtctgaagtt 



tcagctacag 
gctttgtgca 
cacgccactg 
agatcagaaa 
ccatggagat 
accgggccgc 
ctgtgcccca 
ggcggtacct 
cctcctcctg 
agaagttccc 
ttcgtatcac 



ccagtccagc 

cggaggtgtg 

caagaccaac 

aggctgcgtc 

cgaggtgttg 

cagtgccttc 

gctggtgccc 

gcagatgaag 

ccactggtgc 

cccttggcca 

agtgttaacc 
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tgtactctct cctgcaaacc tacacaccaa agctttattt atatcattcc agtatcaatg 1680 

ctacacagtg ttgtcccgag cgccgggagg cgttgggcag aaaccctcgg gaatgcttcc 1740 

gagcacgctg tagggtatgg gaagaaccca gcaccactaa taaagctgct gcttggctgg 1800 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 186 0 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 1893 

<210> 224 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM__007274 



<400> 224 

acctacacac caaagcttta tttatatcat tccagtatca atgctacaca gtgttgtccc 



60 



<210> 225 

<211> 4157 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007315 



<400> 225 

agcggggcgg 

gagtctgcgg 

cagaggcgac 

ggatcctgcg 

aacgttcgca 

ctgctccttt 

tggtacgaac 

gacagttttc 

gagcacgctg 

ctggatgatc 

aggaaaagca 

atcatttaca 

caggctcagt 

agtaaagtca 

gaagatttac 

accaatggtg 

ttaatgcttg 

actgaactta 

cagagcgcct 

actatagttg 

gaacagaaat 

cgcaccttca 

tgcatgccaa 

aagttgagac 

tttgataaag 

ggcacgcaca 

tttcggcacc 

ctcatcgtta 

ttggtaattg 

ctcccgagcg 

ctgtccttct 

tggcagtttt 

gagaagcfctc 

aaggaaaata 

ctcattaaaa 



ggcgccagcg 
aggggctcgg 
ccagcgcgct 
cgcagaaaag 
ctctgtgtat 
ggttgaatcc 
ttcagcagct 
ccatggaaat 
ccaatgatgt 
aatatagtcg 
agcgtaatct 
gctgtctgaa 
cggggaatat 
gaaatgtgaa 
aagatgaata 
tggcaaagag 
acaataagag 
cccagaatgc 
gtattggggg 
cggagagtct 
acacctacga 
gtcttttcca 
cgcaccctca 
tgttggtgaa 
atgtgaatga 
caaaagtgat 
tgcaattgaa 
ctgaagagct 
acctcgagac 
gttgggcctc 
tcctgactcc 
cttctgtcac 
ttggtcctaa 
taaatgataa 
aacacctgct 



ctgccttttc 

ctgcaccggg 

cgggagaggc 

tttcatttgc 

ataacctcga ' 

ccaggccctt 

tgactcaaaa 

cagacagtac 

ttcatttgcc 

cttttctttg 

tcaggataat 

ggaagaaagg 

tcagagcaca 

ggacaaggtt 

tgacttcaaa 

tgatcagaaa 

aaaggaagta 

cctgattaat 

gccgcccaat 

gcagcaagtt 

acatgaccct 

gcagctcatt 

gaggccgctg 

attgcaagag 

gagaaataca 

gaacatggag 

agaacagaaa 

tcactccctt 

gacctctctg 

catcctttgg 

accatgtgca 

caaaagaggt 

cgccagcccc 

aaattttccc 

ccctctctgg 



tcctgccggg 
gggatcgcgc 
tgcaccgccg 
tgtatgccat 
cagtcttggc 
gttggggcac 
ttcctggagc 
ctggcacagt 
accatccgtt 
gagaataact 
tttcaggaag 
aaaattctgg 
gtgatgttag 
atgtgtatag 
tgcaaaacct 
caagaacagc 
gttcacaaaa 
gatgaactag 
gcttgcttgg 
cggcagcagc 
atcacaaaaa 
cagagctcgt 
gtcttgaaga 
ctgaattata 
gtaaaaggat 
gagtccacca 
aatgctggca 
agttttgaaa 
cccgttgtgg 
tacaacatgc 
cgatgggctc 
ctcaatgtgg 
gatggtctca 
ttctggcttt 
aatgatgggt 



tagtttcgct 
ctggcagacc 
cgcccccgcc 
cctcgagagc 
acctaacgtg 
aaggtggcag 
aggttcacca 
ggttagaaaa 
ttcatgacct 
tcttgctaca 
acccaatcca 
aaaacgccca 
acaaacagaa 
agcatgaaat 
tgcagaacag 
tgttactcaa 
taatagagtt 
tggagtggaa 
atcagctgca 
ttaaaaagtt 
acaaacaagt 
ttgtggtgga 
caggggtcca 
atttgaaagt 
ttaggaagtt 
atggcagtct 
ccagaacgaa 
cccaattgtg 
tgatctccaa 
tggtggcgga 
agctttcaga 
accagctgaa 
ttccgtggac 
ggattgaaag 
gcatcatggg 



ttcctgcgca 
ccagaccgag 
tagcccttcc 
tgtctaggtt 
ctgtgcgtag 
gatgtctcag 
gctttatgat 
gcaagactgg 
cctgtcacag 
gcataacata 
gatgtctatg 
gagatttaat 
agagcttgac 
caagagcctg 
agaacacgag 
gaagatgtat 
gctgaatgtc 
gcggagacag 
gaactggttc 
ggaggaattg 
gttatgggac 
aagacagccc 
gttcactgtg 
caaagtctta 
caacattttg 
ggcggctgaa 
tgagggtcct 
ccagcctggt 
cgtcagccag 
acccaggaat 
agtgctgagt 
catgttggga 
gaggttttgt 
catcctagaa 
cttcatcagc 
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aaggagcgag 
agtgagagct 
ggcgaacctg 
ttccctgaca 
ctgaagtatc 
cc aaaggaag 
actgagttga 
ctccccatgt 
gacagtatga 
tccttctcat 
aaatgaaaga 
aactcagaaa 
aggcattctc 
aggacgagaa 
gcataaagtc 
ctgcttgacg 
aagtgtaact 
tacaattata 
gtattttatt 
gaagagaata 
aggatgtaca 
tttcatcttg 
aaccactcat 
atgtttctta 
ccacaaaatt 
gataatatga 
catatgtttc 
tactttttcc 
ctttttcctt 
ttccctttta 
aactatatta 
ttgttttaaa 
tttccgtggc 
aacctaagtt 
aaacaatatt 



agcgtgccct gttgaaggac 
cccgggaagg ggccatcaca 
acttccatgc ggttgaaccc 
tcattcgcaa ttacaaagtc 
tgtatccaaa tattgacaaa 
caccagagcc aatggaactt 
tttctgtghc tgaagttcac 
ctcctgagga gtttgacgag 
tgaacacagt atagagcatg 
ctgtgattcc ctcctgctac 
aaggccagca aattcgctgc 
catcagttac tctgaagggc 
tgaagagtgg gtttcacaag 
tgagggtcct ttgggaaagg 
agtgcccaac tgttataggt 
taggaacggt aaatttctgt 
ggcagttttc cattggttta 
tcagtcctct ttcaaaggta 
acatctttca cattggctat 
ttagctttac tgtttgttat 
tttccaaatt cacaagttgt 
gtcacataca attattttta 
tcaaaagttg aaattaacca 
aatgggctac tttgtccttt 
gggaaaggag tagaaaaagc 
gaatcagatc atttcaaaac 
gctgatatat gtgtttttca 
agacactttt ttgagtggat 
ccttatcact gacacaaaaa 
catactgctg tctatgtggc 
tcatgcaaat gctgtattct 
attaaagcta aagtatctgt 
actgcataca atctgaggcc 
tcagttgatt ttacaattga 



cagcagccgg 

ttcacatggg 

tacacgaaga 

atggctgctg 

gaccatgcct 

gatggcccta 

ccttctagac 

gtgtctcgga 

aatttttttc 

tctgttcctt 

aacctgttga 

atcatgcatc 

tgaaaaacat 

agaagttaag 

tgttggataa 

gggagaattc 

cctgtgaaat 

gccatcatgg 

ttaaagacaa 

ggcttaatga 

gtttgatatc 

cagttctccc 

tagatgtaga 

ttgttattag 

agtaactgac 

tcatttccta 

catttgcgaa 

gatgtttcgt 

gtagattaag 

tgtatcttgt 

tctttggtgg 

attgcattaa 

tcctctctca 

aatgactaaa 



ggaccttcct 
tggagcggtc 
aagaactttc 
agaatattcc 
ttggaaagta 
aaggaactgg 
ttcagaccac 
tagtgggctc 
atcttctctg 
cacatcctgt 
tagcaagtga 
ttactgaagg 
ccagatacac 
caacatctag 
atcagtggtt 
ttacatgttt 
agttcaaagc 
atctggtagg 
agacaaattc 
cactagctaa 
caaagctgaa 
aagggagtta 
taaactcaga 
ggtggtattt 
aacttgaata 
tgtaactgca 
tggttccatt 
gaagtatact 
agatgggttt 
ttttccacta 
agataaagat 
atataatatg 
gtttttatat 
aaacaaagaa 



gctgcggttc 
ccagaacgga 
tgctgttact 
tgagaatccc 
ttactccagg 
atatatcaag 
agacaacctg 
tgtagaattc 
gcgacagttt 
gtttctaggg 
atttttctct 
taaaattgaa 
ccaaagtatc 
caaatgttat 
atttagggaa 
tctttgcttt 
caagtttata 
gggaaaatgt 
tgtttcttga 
tatcaataga 
tacattctgc 
ggctattcac 
aatttaattc 
agtctattag 
atacaccaga 
ttgagaactg 
ctctctcctg 
gtatttttac 
gacaaggttc 
ctgctaccac 
ttcttgagtt 
cacacagtgc 
agatggcgag 
gacaacatta 



2160 
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3600 

3660 
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3960 
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4140 



gtttcta 4157 



<210> 226 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007315 
<400> 226 

atcagatcat ttcaaaactc atttcctatg taactgcatt gagaactgca tatgtttcgc 60 

<210> 227 

<211> 1696 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_009587 



<400> 227 

caaaggactt 

ccttcagcgg 

aaggaggtct 

gaaccaggtt 

tcaaccctcg 



cctagtgggt 
ttcccaggct 
ccaggacgga 
tgctgtgaac 
gtttgaagat 



gtgaaaggca 
ccctacctga 
cttcagatca 
tttcagactg 
ggagggtacg 



gcggtggcca 
gfcccagctgt 
ctgtcaatgg 
gcttcagtgg 
tggtgtgcaa 



cagaggcggc 
ccccttttct 
gaccgttctc 
aaatgacatt 
cacgaggcag 



ggagagatgg 6 0 

gggactattc 12 0 

agctccagtg 180 

gccttccact 240 

aac ggaagc t 3 00 
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gggggcccga ggagaggaag acacacatgc ctttccagaa ggggatgccc tttgacctct 3 60 

gcttcctggt gcagagctca gatttcaagg tgatggtgaa cgggatcctc ttcgtgcagt 42 0 

acttccaccg cgtgcccttc caccgtgtgg acaccatctc cgtcaatggc tctgtgcagc 480 

tgtcctacat cagcttccag aacccccgca cagtccctgt tcagcctgcc ttctccacgg 540 

tgccgttctc ccagcctgtc tgtttcccac ccaggcccag ggggcgcaga caaaaacctc 600 

ccggcgtgtg gcctgccaac ccggctccca ttacccagac agtcatccac acagtgcaga 660 

gcgcccctgg acagatgttc tctactcccg ccatcccacc tatgatgtac ccccaccccg 72 0 

cctatccgat gcctttcatc accaccattc tgggagggct gtacccatcc aagtccatcc 7 80 

tcctgtcagg cactgtcctg cccagtgctc agaggttcca catcaacctg tgctctggga 840 

accacatcgc cttccacctg aacccccgtt ttgatgagaa tgctgtggtc cgcaacaccc 900 

agatcgacaa ctcctggggg tctgaggagc gaagtctgcc ccgaaaaatg cccttcgtcc 9 60 

gtggccagag cttctcagtg tggatcttgt gtgaagctca ctgcctcaag gtggccgtgg 102 0 

atggtcagca cctgtttgaa tactaccatc gcctgaggaa cctgcccacc atcaacagac 1080 

tggaagtggg gggcgacatc cagctgaccc atgtgcagac ataggcggct tcctggccct 1140 

ggggccgggg gctggggtgt ggggcagfcct gggtcctctc atcatcccca cttcccaggc 12 0 0 

ccagcctttc caaccctgcc tgggatctgg gctttaatgc agaggccatg tccttgtctg 12 6 0 

gtcctgcttc tggctacagc caccctggaa cggagaaggc agctgacggg gattgccttc 132 0 

ctcagccgca gcagcacctg gggctccagc tgctggaatc ctaccatccc aggaggcagg 13 80 

cacagccagg gagaggggag gagtgggcag tgaagatgaa gccccatgct cagtcccctc 144 0 

ccatccccca cgcagctcca ccccagtccc aagccaccag ctgtctgctc ctggtgggag 1500 

gtggcctcct cagcccctcc tctctgacct ttaacctcac tctcaccttg caccgtgcac 1560 

caacccttca cccctcctgg aaagcaggcc tgatggcttc ccactggcct ccaccacctg 162 0 

accagagtgt tctcttcaga ggactggctc ctttcccagt gtccttaaaa taaagaaatg 1680 
aaaatgcttg ttggca 1696 

<210> 228 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_009587 
<400> 228 

cagaggactg gctcctttcc cagtgtcctt aaaataaaga aatgaaaatg cttgttggca 6 0 

<210> 229 
<211> 6552 
<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_012291 
<400> 229 

atgaggagct tcaaaagagt caactttggg actctgctaa gcagccagaa ggaggctgaa 60 

gagttgctgc ccgacttgaa ggagttcctg tccaaccctc cagctggttt tcccagcagc 12 0 

cgatctgatg ctgagaggag acaagcttgt gatgccatcc tgagggcttg caaccagcag 180 

ctgactgcta agctagcttg ccctaggcat ctggggagcc tgctggagct ggcagagctg 2 40 

gcctgtgatg gctacttagt gtctacccca cagcgtcctc ccctctacct ggaacgaatt 3 00 

ctctttgtct tactgcggaa tgctgctgca caaggaagcc cagaggccac actccgcctt 3 60 

gctcagcccc tccatgcctg cttggtgcag tgctctcgcg aggctgctcc ccaggactat 42 0 

gaggccgtgg ctcggggcag cttttctctg ctttggaagg gggcagaagc cctgttggaa 480 

cggcgagctg catttgcagc tcggctgaag gccttgagct tcctagtact cttggaggat 540 

gaaagtaccc cttgtgaggt tcctcacttt gcttctccaa cagcctgtcg agcggtagct 600 

gcccatcagc tatttgatgc cagtggccat ggtctaaatg aagcagatgc tgatttccta 660 

gatgacctgc tctccaggca cgtgatcaga gccttggtgg gtgagagagg gagctcttct 72 0 

gggcttcttt ctccccagag ggccctctgc ctcttggagc tcaccttgga acactgccgt 7 80 

cgcttttgct ggagccgcca ccatgacaaa gccatcagcg cagtggagaa ggctcacagt 840 

tacctaagga acaccaatct agcccctagc cttcagcfcat gtcagctggg ggttaagctg 900 
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ctgcaggttg 
gtcctgagca 
cagttcttcc 
ctgagcctct 
g-gtgtgtatg 
cttcacctct 
gctgacctga 
ttagagggcc 
agtaatttgg 
gagccgctct 
gagaagttgc 
gcccagggct 
cacatggctg 
gacaaggagc 
ctggccctcc 
caggaacgct 
ggggcctggg 
tttacgcagc 
gactctgtga 
ttgctgtggc 
cggagagccc 
gataaactcc 
gctgctcagt 
gggcaggccc 
gccctctacc 
attgtctctg 
cagctcctcc 
tcgagcctga 
gatctgcttc 
ctgctgtctg 



gggaggaagg 
agagtatgga 
tttcaggcct 
ttgcttttct 

ggggctcctc 

acactgtggt 
cccaactagt 
tgtcgggcca 
cctacagctt 
gtcagcacct 
acaggtgctt 
gcaagatggt 
agccagtcac 
tacagctaaa 
tgctgaggga 
tcaacatcat 
cacgagccac 
agaccaactg 
ggcctgaggc 
tttacatctg 
aggcccctgg 
aggaagatcg 
ccaaatgcct 
cagctgtacg 
agctggtggc 
agagactgaa 
tgaccctcgg 
agcatctcga 
gaagtcaact 
tgcttcggga 



acctcaggca 
ggcaccatca 
ggaacgaggc 
tggagggtac 
caagcaacag 
ggtttatgac 
ggacagttgt 
agagctgacg 
ctatagtcac 
gggtttggtg 
ccggctacaa 
gatbttgtgg 
tttctgggtt 
gactctgcga 
ggagctgcag 
ctgtgacctc 
ccacctggta 
ctctgctctg 
ccaggccaga 
tactctggaa 
taacttggag 
tttcctatac 
ggaccaagcc 
gtgtctccag 
aaagcccatg 
ggaccactcg 
ctgtcccagc 
tcagactact 
ctactggact 
tcctgccctc 



gtggccaagc 
cccccacttc 
accaagaggc 
tgctctcttc 
cagtcttttc 
tttgcccaag 
aaatctaccg 
gaccacatgg 
aagctctatg 
aagccaggca 
gtagagagtt 
ctggcagccc 
cgggtcaaga 
gacagcctca 
gcctacaagg 
ctggagctga 
gaactggctc 
gatgctatcc 
gatcagcttc 
gccaaaatac 
gaatttgaag 
agtaacattg 
ctggccctgt 
cagacagcag 
caggctctgg 
aaggcagctg 
tatgcccagt 
gacacatacc 
caccagaagg 
cagaagtcct 



ttctgatcaa 
gggcattgta 
gctatagact 
tgcagcagct 
ttcagatgta 
gctgtcagat 
ttgtctggat 
ggatgaccgc 
ccgaggcctg 
cttatcccga 
tgaagaaact 
tgcaaccctg 
tggatgcggc 
gtggctggga 
cggtgcgggc 
gccccgagga 
aggtgctctg 
gggaagccct 
tggacgataa 
aggaaggtat 
tcaatgacct 
ccttcaacct 
ggaaggagct 
cctcactgca 
aggtcctcct 
gctcctcctg 
tacacctgga 
tgctcctttc 
tgaccaaggg 
ccaaggcttg 



ggcatcagct 
tgagagctgc 
tgatgccatt 
gcgggatgat 
ctttcaggga 
agttgatttg 
gctggaggcc 
ttcttacacc 
tgccatctct 
ggtgcctcct 
gggtaaacag 
tagccctgaa 
cagggctgga 
cccggagacc 
cgacactgga 
gacaccagcc 
ctaccacgac 
gcagcttctg 
agcacaggcc 
cgagcgggat 
gaactatgaa 
ggctgcagat 
gcttacaaag 
gatcctagca 
gctgctacgg 
ccacatcacc 
agaggcagca 
cctgacctgt 
tgtctctctg 
gtacttgctg 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 



cgtgtccagg 
cactccctgt 
gactcccata 
actcagaaag 
tggcaggttc 
ctgggtagtg 
ctgcagatac 
cgcaatgaca 
tgcacagagt 
aaggggaagc 
ctaagaggcc 
ccttctacaa 
tcccattcac 
ctgcgctggg 
ctggatctgc 
gctctccaag 
gagatcttgg 
gagagcctgc 
ctagagccct 
agctgctgta 
agtgtccctg 
caaaagttag 
agaggactgc 
cccttcacgg 
cccagggtac 
ttggaagacc 
gcttcccggg 
gccccaggta 



tcctgcagct 
gggagcagct 
agctcctccg 
cagctgtgga 
tttcagaggt 
tgagtgaagc 
cacgccagtg 
ttgatctctg 
ttggtggggt 
agcaggccca 
ctgctctaga 
actcctcccc 
ccacctgtga 
tattggtcac 
tgcaggtcgt 
cttccctgaa 
ctcaagcata 
agaaggttct 
ggcgagccag 
ctacccaact 
gctcagagcc 
cctctgctcc 
cctgcacacc 
tgtttgagga 
aacagagagt 
ctgtctcagc 
gccgggggcg 
gtgcccctgg 



ggtggcagct 
ctgtgcccaa 
aagcatcatc 
gacatcgttt 
gctgagctgc 
caaggccttt 
tgccctgttc 
tcagtcggac 
gactcagcac 
ggtcccctgt 
gctggtggcc 
agtcttgaaa 
ctgctcgctc 
ggcaggggtg 
gctgaagggc 
tcataaaaca 
cacactgttg 
acagtcaggg 
cctgctcttg 
ttttgcaagc 
ctctaagact 
cctgcgcctc 
taaaccccca 
agtctgccct 
ccagacgcgc 
tgaggcctgg 
agcaaggaag 
gaaccctggc 



taccttagcc 
ggctggcaga 
ctcctgctga 
ttggactatg 
tcagagaagc 
tgcttggagg 
ctggtgctga 
ctgcagcagg 
ctggactctg 
cctccacagc 
actgtggcca 
accaagcccc 
tgcgccagcc 
aggctggcca 
tgtcctgaag 
cccccctcct 
gcactggagg 
ctgaagtttg 
atttgggccc 
tcctggggct 
cagggccaaa 
aataatacct 
gaccggatca 
acagagagca 
ctcaaggtga 
ctggcagagg 
ggcctgagcc 
ctgaatggca 



tcccgtcaaa 
cacctgagat 
tgggcagtga 
gtgaaaatct 
tggtctgcca 
ccctaaaact 
agggcgagct 
ttctgttctt 
tgaagaaggt 
tcccagagga 
aggagcctgg 
agcccatacc 
ctgtcctcac 
tgggccacca 
ccgctgagcg 
tggttccaag 
gcctgaacca 
tagcagcacg 
tcacaaaact 
ggcagccacc 
aacgttctgg 
ctcagaaagg 
ggcaagctgg 
agcctgaagt 
acttcagtga 
agcctaagag 
taaagacgga 
ggagccggag 



caacctctca 
agctctcata 
cattctctca 
ggtacaaaaa 
cctgggccgc 
tacaacaaag 
ggagctggcc 
gcttgagtct 
ccacctgcag 
ggagctcttc 
ccccatagca 
caacttcctg 
agcagtctgt 
agcccagggt 
cctcacccaa 
cctcttggat 
gccatcaaac 
gataccccac 
aggtggcctc 
attaataaaa 
acgagggcgc 
tctggaaggt 
ccctcatgtc 
accccaggcc 
tgacagtgac 
acggggcact 
tgccgtggtt 
ggccaagaag 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 



gtggcatcaa gacattgtga ggagcggcgt ccccagaggg ccagtgacca ggccaggcct 4440 
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ggccctgaga tcatgaggac catccctgag gaagaactga ctgacaactg gagaaaaatg 4500 

agctttgaga tcctcagggg ctctgacggg gaagactcag cctcaggtgg gaagactcca 45 60 

gctccgggcc ctgaggcagc ttctggagaa tgggagctgc tgaggctgga ttccagcaag 462 0 

aagaagctgc ccagcccatg cccagacaag gagagtgaca aggaccttgg tcctcggctc 4680 

cagctcccct cagcccccgt agccactggt ctttctaccc tggactccat ctgtgactcc 4740 

ctgagtgttg ctttccgggg cattagtcac tgtcctccta gtgggctcta tgcccacctc 480 0 

tgccgcttcc tggccttgtg cctgggccac cgggatcctt atgccactgc tttccttgtc 486 0 

accgagtctg tctccatcac ctgtcgccac cagctgctca cccacctcca cagacagctc 492 0 

agcaaggccc agaagcaccg aggatcactt gaaatagcag accagctgca ggggctgagc 4980 

cttcaggaga tgcctggaga tgtccccctg gcccgcatcc agcgcctctt ttccttcagg 5040 

gctttggaat ctggccactt cccccagcct gaaaaggaga gtttccagga gcgcctggct 510 0 

ctgatcccca gtggggtgac tgtgtgtgtg ttggccctgg ccaccctcca gcccggaacc 516 0 

gtgggcaaca ccctcctgct gacccggctg gaaaaggaca gtcccccagt cagtgtgcag 522 0 

attcccactg gccagaacaa gcttcatctg cgttcagtcc tgaatgagtt tgatgccatc 5280 

cagaaggcac agaaagagaa cagcagctgt actgacaagc gagaatggtg gacagggcgg 53 4 0 

ctggcactgg accacaggat ggaggttctc atcgcttccc tagagaagtc tgtgctgggc 5400 

tgctggaagg ggctgctgct gccgtccagt gaggagcccg gccctgccca ggaggcctcc 5460 

cgcctacagg agctgctaca ggactgtggc tggaaatatc ctgaccgcac tctgctgaaa 552 0 

atcatgctca gtggtgccgg tgccctcacc cctcaggaca ttcaggccct ggcctacggg 5 580 

ctgtgcccaa cccagccaga gcgagcccag gagctcctga atgaggcagt aggacgtcta 5 640 

cagggcctga cagtaccaag caatagccac cttgtcttgg tcctagacaa ggacttgcag 5700 

aagctgccgt gggaaagcat gcccagcctc caagcactgc ctgtcacccg gctgccctcc 5760 

ttccgcttcc tactcagcta ctccatcatc aaagagtatg gggcctcgcc agtgctgagt 5 82 0 

caaggggtgg atccacgaag taccttctat gtcctgaacc ctcacaataa cctgtcaagc 5 880 

acagaggagc aatttcgagc caatttcagc agtgaagctg gctggagagg agtggttggg 5940 

gaggtgccaa gacctgaaca ggtgcaggaa gccctgacaa agcatgattt gtatatctat 6000 

gcagggcatg gggctggtgc ccgcttcctt gatgggcagg ctgtcctgcg gctgagctgt 6 060 

cgggcagtgg ccctgctgtt tggctgtagc agtgcggccc tggctgtgca tggaaacctg 6120 

gagggggctg gcatcgtgct caagtacatc atggctggtt gccccttgtt tctgggtaat 6180 

ctctgggatg tgactgaccg cgacattgac cgctacacgg aagctctgct gcaaggctgg 6240 

cttggagcag gcccaggggc cccccttctc tactatgtaa accaggcccg ccaagctccc 63 00 

cgactcaagt atcttattgg ggctgcacct atagcctatg gcttgcctgt ctctctgcgg 63 60 

taaccccatg gagctgtctt attgatgcta gaagcctcat aactgttcta cctccaaggt 6420 

tagatttaat ccttaggata actcttttaa agtgattttc cccagtgttt tatatgaaac 6480 

atttcctttt gatttaacct cagtataata aagatacatc atttaaaccc tgaaaaaaaa 6540 
aaaaaaaaaa aa 6552 

<210> 230 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_012291 
<400> 230 

agcctcataa ctgttctacc tccaaggtta gatttaatcc ttaggataac tcttttaaag 60 



<210> 231 

<211> 6317 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM„013261 
<400> 231 

tagtaagaca ggtgccttca gttcactctc agtaaggggc tggttgcctg catgagtgtg 60 
tgctctgtgt cactgtggat tggagttgaa aaagcttgac tggcgtcatt caggagctgg 12 0 
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atggcgtggg acatgtgcaa ccaggactct gagtctgtat ggagtgacat cgagtgtgct 180 

gctctggttg gtgaagacca gcctctttgc ccagatcttc ctgaacttga tctttctgaa 240 

ctagatgtga acgacttgga tacagacagc tttctgggtg gactcaagtg gtgcagtgac 300 

caatcagaaa taatatccaa tcagtacaac aatgagcctt caaacatatt tgagaagata 3 60 

gatgaagaga atgaggcaaa cttgctagca gtcctcacag agacactaga cagtctccct 42 0 

gtggatgaag acggattgcc ctcatttgat gcgctgacag atggagacgt gaccactgac 480 

aatgaggcta gtccttcctc catgcctgac ggcacccctc caccccagga ggcagaagag 540 

ccgtctctac ttaagaagct cttactggca ccagccaaca ctcagctaag ttataatgaa 600 

tgcagtggtc tcagtaccca gaaccatgca aatcacaatc acaggatcag aacaaaccct 660 

gcaattgtta agactgagaa ttcatggagc aataaagcga agagtatttg tcaacagcaa 72 0 

aagccacaaa gacgtccctg ctcggagctt ctcaaatatc tgaccacaaa cgatgaccct 7 80 

cctcacacca aacccacaga gaacagaaac agcagcagag acaaatgcac ctccaaaaag 840 

aagtcccaca cacagtcgca gtcacaacac ttacaagcca aaccaacaac tttatctctt 900 

cctctgaccc cagagtcacc aaatgacccc aagggttccc catttgagaa caagactatt 960 

gaacgcacct taagtgtgga actctctgga actgcaggcc taactccacc caccactcct 102 0 

cctcataaag ccaaccaaga taaccctttt agggcttctc caaagctgaa gtcctcttgc 1080 

aagactgtgg tgccaccacc atcaaagaag cccaggtaca gtgagtcttc tggtacacaa 1140 

ggcaataact ccaccaagaa agggccggag caatccgagt tgtatgcaca actcagcaag 12 00 

tcctcagtcc tcactggtgg acacgaggaa aggaagacca agcggcccag tctgcggctg 12 60 

tttggtgacc atgactattg ccagtcaatt aattccaaaa cagaaatact cattaatata 132 0 

tcacaggagc tccaagactc tagacaacta gaaaataaag atgtctcctc tgattggcag 13 80 

gggcagattt gttcttccac agattcagac cagtgctacc tgagagagac tttggaggca 1440 

agcaagcagg tctctccttg cagcacaaga aaacagctcc aagaccagga aatccgagcc 1500 

gagctgaaca agcacttcgg tcatcccagt caagctgttt ttgacgacga agcagacaag 1560 

accggtgaac tgagggacag tgatttcagt aatgaacaat tctccaaact acctatgttt 162 0 



ataaattcag gactagccat ggatggcctg 
ctgagctacc cttgggatgg cacgcaatcc 
tcttctttta actctccatg tagagattct 
agaccccaaa ggatgcgctc tcgttcaagg 
tcaccatatt ccaggtcaag atcaaggtct 
tattacfcatg agtcaagcca ctacagacac 
agatcacgtt caagatcgcc ctacagccgt 
cagcacgaga ggctgaagag ggaagaatat 
agggccaagc aaagggagag gcagaggcag 
gtcggtaaaa tcagacctga cacaacacgg 
ggtgaaattg aggagtgcac agtaaatctg 
acctaccgtt atacctgtga tgcttttgct 
tcaaacgaaa ctgactttga gctgtacttt 
tatgcagacc tagattcaaa ctcagatgac 

gactctctgg attttgatag tttactgaaa 
gttccctagc tgaggatgac agagggatgg 
ctaaagacta ttgcaagtca tacttaggaa 
acaaaacaaa acaacaacaa tacaacaaga 
atgaacacag ctgctgaaga ggcaagagac 
catgggtgtc agctttgctt ttcctggagt 
gtatgtgtgt gtgtatgtat gtgtgtggtg 
ggtacatgtg aggactgggg gcacctgacc 
gcagcagttc catgaagaca cgcttaaaac 
aaaaggaaat atatatatat atatatatat 
agaaaactaa caaccaacca accaaccaac 
ctgatgtctg ggcatcagcc tttgtactct 
aagcaagctt tctctcataa cgtaatgatt 
catagaacta atatcctgtc tctctctctc 
ccttttgcca tggaatctgg gtgggagagg 
ctaacatttt gaagtttctg tagttcatcc 
atgttgatct tccactgcaa atttcaaaag 
cagcggttct ttctgaggag cggacaccgg 
tctctgagat gtgttcagat agtgtaattg 



tttgatgaca gcgaagatga aagtgataaa 1680 

tattcattgt tcaatgtgtc tccttcttgt 1740 

gtgtcaccac ccaaatcctt attttctcaa 1800 

tccttttctc gacacaggtc gtgttcccga 1860 

ccaggcagta gatcctcttc aagatcctgc 1920 

cgcacgcacc gaaattctcc cttgtatgtg 19 80 

cggcccaggt atgacagcta cgaggaatat 2 040 

cgcagagagt atgagaagcg agagtctgag 2100 

aaggcaattg aagagcgccg tgtgatttat 2160 

acagaactga gggaccgttt tgaagttttt 2220 

cgggatgatg gagacagcta tggtttcatt 22 80 

gctcttgaaa atggatacac tttgcgcagg 2340 

tgtggacgca agcaattttt caagtctaac 2 400 

tttgaccctg cttccaccaa gagcaagtat 2460 

gaagctcaga gaagcttgcg caggtaacat 2 52 0 

cgaatacctc atgggacagc gcgtccttcc 2 5 80 

tttctcctac tttacactct ctgtacaaaa 2 640 

acaacaacaa caataacaac aatggtttac 2 7 00 

agaatgatat ccagtaagca catgtttatt 2 7 60 

ctcttggtga tggagtgtgc gtgtgtgcat 2 820 

tgtgtgcttg gtttagggga agtatgtgtg 2 8 80 

agaatgcgca agggcaaacc atttcaaatg 2940 

ctagaacttc aaaatgttcg tattctattc 3 0 00 

atatatatat atatataaat taaaaaggaa 3 0 60 

caaccacaaa ccaccctaaa atgacagccg 312 0 

gtttttttaa gaaagtgcag aatcaacttg 3180 

atatgacaat cctgaagaaa ccacaggttc 32 40 

tctctctctc tctctttttt ttttcttttt 3300 

atactgcggg caccagaatg ctaaagtttc 3 3 60 

ttaatcctga cacccatgta aatgtccaaa 3420 

ccttgtcaat ggtcaagcgt gcagcttgtt 3 480 

gttacattac taatgagagt tgggtagaac 3 540 

ctacattctc tgatgtagtt aagtatttac 3 600 
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agatgttaaa 
tacagctatg 
gaatattcaa 
gaacttactg 
agtgcaacaa 
aagcatcttg 
ttccacattt 
aatgaaattt 
caatgggtaa 
aagtcaaaaa 
aaaatgctga 
tatctttgaa 
tttttacagt 
acatatttat 
aagccagtgg 
ttcatttggg 
tgacgtgacc 
agaaaaacct 
ttcagtaaca 
tgacatgact 
tcgcccaaca 
agctgcattc 
ggatgcttgc 
tatgaaatga 
gcctaattct 
taaagtttac 
aagactatta 
tgttcaagga 
tacattctcc 



cttatcaaaa 
agcctgacaa 
agaattagtg 
ctgtgctcct 
gcaccccatc 
tgggccattg 
tgctttagta 
gtgcaacaaa 
tcaaagcagc 
aaggaaacct 
tcttgatggc 
ctagaatagc 
ttcaataatg 
atactgttgg 
gccttttttc 
ttcctgtgat 
acgaatgttt 



tggagtattt ttattttatg tatatactat 
cactgtaaat gcagccttct tttcaaaact 
atgtaattat gaggtgaaac aattattgta 
cttatatata tttgattgta aaaacaaaaa 
gagcaaaatg atgctttccg cacatccatc 
tcaagaaata tcctagtccc ctaaaggtat 
tcttgtcgct tgtttttctt tgaagtttta 
tctcatctaa aatttttcta gaagatatca 
ccattaagaa atgtttttat tttctctatc 
tctttttaaa atgctgtttt gttttaattt 
ggtaataatt atagtatgat ttttacaata 
gccagtattt ctttcccttg gcagagtatg 
tatgcatcct gtataaatac tgatatttca 
cagttgcaga tagcctattt attataaatt 
aaattttcta cctaggatgc atgacaattg 
aaattcagct tttgcagaag cagtgtttct 
atggtgttgt tcttgatgac attgcttctg 
ccattttgat catcaggatt tcatctgagt 
tttggagtgt gtattcaagt ttctaaattg 
tttctggaag acatgataca cctactactc 
cgatcttgta agatggattt cacccccagg 
atttatcacc agcatattgt gttctgagtg 
ttgatttttt ggcttcttat ttctaagtag 
aagaacttgt tcacaggttc tgcgttacaa 
tgttgttctg taggttaaat gcaggtattt 
agtctttctt tctgaatttt gagtatcttc 
agagcaataa attattttta agaaatcgag 
ccacatgtgt tctctatttt gcctttaaat 
tttttgccct ggattgttga catgagtgga 
gacagcacta cagatatcat attgaggatt 
atattgttac catgaagata gttttcctca 
gagcttttgt atcttctgca gacactgtgg 
ctcattttta tttttatttt tttgggagag 
atcactggag gcaaatttca gcatagatct 
ccttcatgcc gtggtaagta ccacatctac 
atgtggattt ttttcttttt taaaagagat 
atcaattttt tgctaaacga ctccgagaac 
agagagggaa ctttgcacta ttggggtatg 
tttcatgcct ttagatgtga gcttccagta 
tgtaatgaga acttcaatca ctgtagtcta 
catgtactat aatgtgatga ttctaaattt 
tgaactgctg atttgatgga gctactttaa 
ttgaactatg ctgaagaggg aaagtgagcg 
cacctgccaa ttctacatgt attgttgtgg 
tttttttaaa tgtgcagtac acatcagcct 
caaatct 6317 



acaacaatgt 
gctaaatttt 
cactaacata 
gacagtgtgt 
ccttaggtga 
taaccacttc 
tacactggat 
tgattttatg 
aacagtagtt 
ttgtgatttt 
attaatgtgt 
acgatggtat 
ttcctttgtt 
atgagatgat 
tcaggttgga 
acttgcacta 
ctaaatttaa 
gtggagtccc 
agattcgatt 
aattgttctt 
ccaatgcagc 
aatccactgt 
atagaaagca 
cagtaacaca 
taactgtgtg 
tgttgtagaa 
atttagtaaa 
ttttgtgaac 
atacttggtt 
aatttatccc 
atggacttca 
gtagcccatc 
aatatttcaa 
gtaggatttt 
aattttggta 
gtagcagaat 
aacagttggg 
atgtttgggt 
ggtaatgatt 
agacctgatc 
gtacctatgt 
gatttgtagg 
attagttgag 
ttttattcat 
cactgagcta 



tcttttttgt 
tcttaatcaa 
tttagaagct 
gtgtctgttg 
gcttcaatct 
tgcgatattt 
ttgttagggg 
taaagtctct 
ttgaaactag 
aatttgatac 
gtctgaagac 
ttatctgtat 
tactaaagag 
gaaaataata 
gtgtaagtgc 
gcatggcctc 
taaaaacttc 
tggaatggaa 
actgtttggc 
ttcctttctc 
taattttgat 
ttgtcctgtc 
ataaaaatac 
tctttaatcc 
aacgccaaac 
taataataaa 
tcctattatg 
caattttaaa 
tcttttctta 
ccctaccccc 
aattgcatct 
aaaatgtaag 
atgaacacgt 
tagaagaccg 
accgaactgg 
aattcttcca 
ctgtcaacat 
cagttgataa 
atgtgtcctt 
tatagatgac 
gacagacatt 
tgaaagtgta 
cccttgccgg 
tgtatgaaaa 
ataaagggaa 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 



<210> 232 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_013261 



ctgtagtcta agacctgatc tatagatacc tagaatagcc atgtactata atgtgatgat 

<210> 233 
<211> 3237 
<212> DNA 
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<213> Homo sapiens 



<300> 

<308> NM_013277 



<400> 233 

gcgaagtgaa 

ctaaataata 

ggaaaattac 

ttactgaaga 

tgaatgtgcg 

atgaagtcca 

ggactgacca 

gtgctctgga 

gacagagagc 

tgctcatgtg 

cttttctcaa 

ttgatgaatc 

attgggactc 

ctagccgaca 

ctgcagtaga 

atggcgggcc 

gaaggaaaac 

tggagccaag 

tgcatgactt 

ggataaaatt 



gggtggccca 

catgggggga 

aactattttc 

ttatactact 

gaatctgttt 

atttatccag 

tgagctgggg 

tgttaagctg 

tgaggctgac 

tgacacatct 

cagaggccaa 

tggttccatt 

ttctttggtg 

gtttgttgat 

ccaggggaat 

catcgaagct 

aggtacttta 

aactgagaca 

tgtttctaag 

tggcaaatta 



ggtggggcca 

aataaacaag 

aattgacgtt 

ggtagataga 

gagcagcttg 

ttggcgaagg 

aaatacaagg 

aagcatgcac 

tgcgaaaagc 

ggcagcattc 

ccatccagca 

ttatcagata 

aagactttca 

ggtccccctg 

gaatccatag 

gtgtccacta 

caaccttgga 

gacagtgtgg 

acggttatta 

tctctgaagt 



ggctgactga 
tattcatgag 
gaataggatg 
agagctaaag 
tgcgccgggt 
actttgagga 
atcttttgat 
gtaatcaggt 
tggaacgaca 
aactaagcga 
gcaatgctgg 
tcagctttga 
aactgaagaa 
gacctgtaaa 
ttgcaaaaac 
ttgagactgt 
acagtgactc 
gcacgccaca 
aacctgaatc 
gtcgagactg 



atgtatctcc 

ggtgaaaatg 

agtcatggaa 

aaagatggat 

ggagattctc 

tttccgtaaa 

gaaagcagag 

ggatgtagag 

gattcagctg 

ggagcaaaaa 

gaacaaaaga 

caagactgat 

gagagaaaag 

gaaaactcgt 

tacagtgact 

gccatattgg 

caccctgaac 

gagtaatgga 

ctgtgttcca 

tcgtgtggtc 



tagctatgga 

tgacccagca 

tttaagtgat 

actatgatgc 

agtgaaggaa 

aagtggcaga 

actgagcgaa 

atcaaacgga 

attcgagaga 

tcagctctgg 

ctatcaacca 

gaatcactgg 

aggcgctcta 

tccattggct 

gttcccaatg 

accaggagcc 

agcaggcagc 

gggatgcgcc 

tgtggaaagc 

tctcatccag 



aatgtcggga 
ttggagaggg 
ttgtgcattg 
tctctggctg 
tacccctcct 
ttcgaaacct 
cagaaatcac 
cccaggccaa 
gtccacatac 
cccatgctgt 
aggtggttga 
agcaagagaa 
agacaccaga 
tcaagactcc 
ctcctagatt 
ttgcttctcc 
tataagaaag 
gcattctcca 
tatctcccct 
ggagctttta 
ttctttttgg 
tgatctttct 
tatctgataa 
tagaccagag 
aactgagcta 
tgggagtatg 
ccttgtaaat 
ggttcatttg 



ccgctgtccc 
aatgctggca 
tgtaaatgag 
tgaccgcaca 
cagcaaagtg 
caaagaacct 
agatgaagac 
cagggacaca 
taaaatggat 
gcccaatcca 
gcgcctgctt 
cattgacccc 
tattaaagtg 
ttcatctagt 
tgggagcaaa 
aatgctcaag 
gacacatctg 
ggcttttact 
tccttctcct 
gatgagacat 
ggaagggggt 
ctagtttcat 
gattagcttg 
gatttaggat 
taatatggag 
cgtagctttg 
gtgaaagtac 
gggtcaaagc 



cttccctgca 
gactttgtgt 
attgagcaaa 
gtaaaagagc 
gatgatatcc 
cttctgacct 
aacagcatag 
ttagctttcc 
gttgccaatc 
gacccagtga 
tccttgcctc 
ctacatgtca 
agtttactgg 
tccctgtcac 
agcaagtctg 
tgaagtcaca 
tactctgctc 
caagtttaat 
caagtcacat 
ctttccaggg 
tattgttcct 
ttagaactga 
tctcagggtg 
gcctccttct 
ctttcataaa 
atttggatga 
aactcgtatt 
cagtttttct 



ttcctaccct 
cccagacttc 
gaggtctgac 
tgaaagagaa 
atgctatctg 
ttcgccttaa 
ctgccatgta 
tcatgattca 
tggctaaagt 
caatgttaca 
tggagtattg 
ttgaaaactc 
gacctgtgac 
agagagtccg 
ccactaacct 
tctgcctgtt 
tgcagcctcc 
tgtgcatgag 
aatatcagca 
gtagaagggt 
ttggcttaaa 
tttccgtgag 
ggaagtggga 
aagaaccaga 
aatgggatgc 
ttaggtcttt 
tatctctgat 
tttaaaattg 



gataggaaca 
tccaatgatc 
tgagacaggc 
attcctcaga 
tagccttcta 
cagagccttt 
ccaagctgtt 
cttgcagaga 
ctttggccct 
ggacatcaag 
gagtcagttc 
aaatgccttt 
cactcctgaa 
ttccaccctc 
aggacgacaa 
acttcccagc 
tgtactcatt 
ggttttatta 
ctttgtgctg 
tagtatggaa 
gccaaatgct 
acaatgacag 
gggcagggca 
agttctcatt 
attgaggaca 
aatagtgttg 
gtgccgctgg 
aattcattct 



cctgtcaaga 
ccctccattg 
ctgtatagga 
gtgaaaactg 
aaagactttc 
atggaagcag 
ggtgaactgc 
gtggctcaga 
acaatagtgg 
cgtcaaccca 
atgatggtgg 
tcaacaccac 
catcagcttc 
accaagaaca 
ggcaactttt 
attgactgac 
actactttta 
aaactatata 
gtcattgttg 
ttggttgtga 
gctcatagaa 
aaaccctacc 
aagaaaggat 
ccccattatg 
gaactagtga 
agtggcacaa 
ctgaactttg 
gatgcttggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



ccccataccc 
tggcatccca 
aagaaaagct 
atagccacat 
aaaaacgatt 



ccaaccttgt 
actaacaata 
acagtcattt 
ctagatagat 
ttcttctgta 



ccagtggagc 
aagagtaggc 
tttctttgca 
gtgagctttt 
gaatgtttga 



ccaacttcta 
tataagggaa 
ctttggatgc 
tcttctgtta 
cttcgtattg 



aaggtcaata 
gattgtcaat 
tgaaattttt 
aaattattct 
acccttatct 



tatcatcctt 
attttgtggt 
cccatggaac 
taatgtctgt 
gtaaaacacc 



2940 
3000 
3060 
3120 
3180 
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tatttgggat aatatttgga aaaaaagtaa atagcttttt caaaatgaaa aaaaaaa 3237 

<210> 234 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJ)132 77 



<400> 234 

ctcattcccc attatgaact gagctataat atggagcttt cataaaaatg ggatgcattg 6 0 

<210> 235 

<211> 1122 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_013409 



<400> 235 

gctcctcgcc 

tgcctcctgc 

tgctggctcc 

aaggaggagt 

gacaacacac 

aaagaaacgt 

aacaaacccc 

tgcgggctgg 

gagcagccag 

ttctgtccag 

tgtaatcgga 

gtcacctact 

ggattagcct 

ggtgggaaaa 

gagctgtgcc 

gccagcgagt 

cactccggat 

caggactaca 

gtgttcacat 



ccgcgcctgc 
tgctgctgct 
gtcaagcgaa 
gctgcagcac 
tcttcaagtg 
gtgagaacgt 
gctgcgtctg 
atgggaaaac 
aactggaagt 
gcagctccac 
tttgcccaga 
ccagtgcctg 
atgagggaaa 
aatgtttatg 
ctgacagtaa 
gtgccatgaa 
cttgcaactc 
gctttcctat 
agcctttgtg 



ccccaggatg 
ctgccagttc 
gaacggccgc 
cggccggctg 
gatgattttc 
ggactgtgga 
cgccccggat 
ctaccgcaat 
ccagtaccaa 
atgtgtggtg 
gcctgcttcc 
ccacctgaga 
gtgtatcaaa 
ggatttcaag 
gtcggatgag 
ggaagctgcc 
catttcggaa 
atcttotatt 
caaaaaaaaa 



gtccgcgcga 
atggaggacc 
tgccaggtcc 
agcacctcgt 
aacgggggcg 
cctgggaaaa 
tgttccaaca 
gaatgtgcac 
ggcagatgta 
gaccagacca 
tctgagcaat 
aaggctacct 
gcaaagtcct 
gttgggagag 
cctgtctgtg 
tgctcctcag 
gacaccgagg 
ctagagtggt 
aaaaaaaaaa 



ggcaccagcc 
gcagtgccca 
tgtacaagac 
ggaccgagga 
cccccaactg 
aatgccgaat 
tcacctggaa 
tcctaaaggc 
aaaagacttg 
ataatgccta 
atctctgtgg 
gcctgctggg 
gtgaagatat 
gccggtgttc 
ccagtgacaa 
gtgtgctact 
aagaggagga 
aaactctcta 
aa 1122 



gggtgggctt 
ggctgggaac 
cgaactgagc 
ggacgtgaat 
catcccctgt 
gaacaagaag 
gggtccagtc 
aagatgtaaa 
tcgggatgtt 
ctgtgtgacc 
gaatgatgga 
cagatctatt 
ccagtgcact 
cctctgtgat 
tgccacttat 
ggaagtaaag 
agatgaagac 
taagtgttca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



<210> 236 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_013409 



<400> 236 

gaagatgaag accaggacta cagctttcct atatcttcta ttctagagtg gtaaactchc 60 

<210> 237 

<211> 11389 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_J)14246 
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<400> 237 
atggcgccgc 
ctgccggcga 
gccttcgccc 
ccgcgggagc 
ggcgcggggc 
ctgagccgcc 
tgcggaaccg 
gccgcgcagc 
cgccccaggc 
ctgcgtctgc 
gaggccgcca 
aacttgcccg 
gggagcctga 
ggcaccctca 
agctattaca 
gccacgggcg 
ctcagggtga 
gtcttggtca 
cgcgtgcggg 
gactcgccca 
cagctcaacg 
gccgagtacc 
gccacggcca 
gagcagaact 
gtgcaggcca 
gggaacgtgg 
cccttggatt 
cggcccccgc 
aacgagccta 
ggctaccccg 
cactatcgcc 
aatcctgccc 
gtgtgtgccg 
gaccacggct 
aatgacaacg 
gccgtgggga 
acctaccagc 
ggcggcctca 
gcggtgacag 
gatgccaaca 
gacaggcctg 
aatgcccgca 
agtggcacca 
accatcatgg 
ctcatcctcg 
atctttgagg 
tcaggtccca 
ttctacatcg 
gtggccgtgt 
gcctcggtag 
aaggacgaac 
attcgtgcta 
ggggacatgc 
gagctggact 
ctggtgagcc 
ctgcccgact 
accggcgtga 
accttcgtgc 
cagctcagcc 



cgccgccgcc 
tggggctgcg 
tccggcccgg 
tgctggacgt 
gcccgctgcc 
gcctgcgggc 
gtgcccggct 
attcggcgct 
cccgctgtcc 
tgtgcgccct 
ccgcggggac 
aagcccgggc 
agtttccgat 
tcctccagct 
tggaggggct 
ccgtgagcac 
aagccgtgga 
aagacaccaa 
agaacctgga 
tcaacgccaa 
agagctctgg 
agctcctggt 
ccgtgtacat 
acgtggtcca 
cggaccggga 
ccggccagtt 
tcgaggatgt 
tcatcaattc 
tctfctgtgag 
tggtgcacat 
tggtggacac 
ccacccctga 
agctggaccg 
cgccccccat 
acccggtgtt 
gcagcgtgct 
tcacaggcgg 
tcaccctggc 
catccgacgg 
cccacaggcc 
tgggcacctc 
tcacctacgt 
tgtacaccab 
cccaggacaa 
atgccaatga 
atgctccacc 
atgggcgtct 
agcccacgtc 
acaacctttg 
aaatccaggt 
tggagctgtt 
acgaccctga 
ggcatttctt 
ttgaggtccg 
gagccacggt 
tccagatcct 
tcggctgcat 
agggcaacga 
gcgacctgga 



cgtgctgccc 
agcggccgcc 
ctgtacctac 
gggccgcgat 
gctgcaagtc 
gcgcacgcac 
ctgcggggcg 
cgcagctccg 
cggccgtccc 
gcggcgcgcg 
gccctccgcg 
ggggccggcg 
gcccaactac 
gcacgcgcac 
gttcgacgag 
ggacagcgta 
ctacagtacg 
cgaccacagc 
ggtgggctac 
cttgcgttac 
cgtggtgagc 
ggaggccaac 
cgaggtggag 
ggtgcccgag 
ccagggccag 
ctacctgcac 
ccagaaatac 
ttcaggggtg 
cagccccttc 
tcaggcggtg 
ggcctccacc 
cttccccttc 
cgaggaggtg 
gagctcctcc 
cacgcagccc 
gaccctgcag 
caacacccgg 
gctacctctg 
cacacggtcg 
tgtctttcag 
cattgctacc 
gattcaggac 
gatggagctg 
cggcatcccg 
caatgcaccc 
ctcgaccagc 
gctgtacacc 
cggtgtgatt 
ggctctggct 
gaccatcttg 
tgttgaggag 
tgaaggccct 
ccagctggac 
gcgggagtat 
gcacatcctt 
cttcaacaac 
cccggcccat 
gctgcgcctg 
caacaaccgg 



gtgctgctgc 
tgggagccgc 
gcggtgggcg 
gggcggctgg 
cgcttggtgg 
ctfccccggct 
ctctgcttcc 
accaccttac 
atctgcctgc 
gctggcgccg 
tcgccatccc 
cgacgggccc 
caggtggcgt 
tacaccatcg 
cgctcccggg 
ctggaccgcg 
ccgccgcgct 
ccggtcttcg 
gaggtgctga 
cgcgtgttgg 
acacgggcgg 
gaccaggggc 
gacgagaacg 
gacgtggggc 
aacgcggcca 
tcgctgagcg 
tcgctgagca 
gtgtctgtgc 
caggccacgg 
gacgcggact 
tttctggggg 
cagatccaca 
gagcactaca 
accagcgtgt 
acctacgagc 
gcccgcgacc 
aaccgctttg 
gactacaagc 
cacactgcgc 
agctcccatt 
ctcagtgcca 
cccgtgccgc 
gactatgaga 
cagaaatcag 
cagttcctgt 
atcctccagg 
ttccagggtg 
cgcacccagc 
gtggatcggg 
gacattaatg 
aacaacccag 
aatgcccaga 
ctgctcaacg 
gtgctggtgg 
ctcgtggacc 
tatgtcacca 
gaccccgacg 
ttgctgctgg 
ccgctggagg 



tcctggccgc 
gcgtacccgg 
ccgcttgcac 
caggacgtcg 
cccgcagtgc 
gcggagcccg 
ccgtccccgg 
ccgcctgccg 
cgccgggcgg 
tccgggtggg 
catcgccgcc 
ggcggggcac 
tgtttgagaa 
agggcgagga 
gctacttccg 
agaccaagga 
cggccaccac 
agcagtcgga 
ccatccgcgc 
ggggcgcgtg 
tgctggaccg 
gcaatccggg 
acaactaccc 
tcaacacggc 
ttcactacag 
ggatcctgga 
ttaaggccca 
aggtgctgga 
tgctggagaa 
ctggagagaa 
gcggcagcgc 
acagctccgg 
gcttcggggt 
ccatcacggt 
ttcgtctgaa 
gtgacgccaa 
cactcagcag 
aggagcagca 
atgtcctaat 
acacagtgag 
acgatgagga 
agttccgcat 
accaggtcgc 
acaccaccac 
gggatttcta 
tctctgccac 
gggacgacgg 
gccggctgga 
gcagtcccac 
acaatgcccc 
tggggtcggt 
tcatgtatca 
gggacctgcg 
tgcaggccac 
agaatgacaa 
acaagtccaa 
tgtcagacag 
accccgccac 
cgctcatgga 



cgccgccgcc 
cgggacccgc 
gccccgggcg 
gcgcgtctcg 
cccgacggcg 
tgcccggctc 
cggctgcgcg 
ctgcccgccg 
ctcggtccgc 
actggcgctg 
cctgccgccg 
gagcggcaga 
cgaaccggcg 
ggagcgcgtg 
aatcgactct 
gacgcacgtc 
ctacatcact 
gtaccgcgag 
cagcgaccgc 
ggacgtcttc 
ggaggaggcg 
cccgctcagt 
ccagttcagc 
tgtgctgcga 
catcctcagc 
tgtgatcaac 
ggatgggggc 
tgtcaacgac 
tgtgcccctg 
cgcccggctg 
tgggcctaag 
ttggatcaca 
ggaggcggtg 
gctggacgtg 
tgaggatgcg 
cagtgtgatt 
ccagagaggg 
gtacgtgctg 
caacgtcact 
tgtcagtgag 
cacaggagag 
tgaccccgac 
ctacacgctg 
cctagagatc 
ccagggttcc 
ggaccgggac 
cgatggggac 
ccgggagaat 
tccccttagc 
catgtttgag 
ggtggcaaag 
gattgtggaa 
tgccatggtg 
gtcggctccg 
cccgcctgtg 
cagtttcccc 
cctcaactac 
gggcgaactg 
ggtgtctgtg 
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^aataaca tccacagcgt cacggccttc tgcaccctgc gtgtcaccat catcacggac 
o^aScSa ccaacaSt cactgtccgc ctggagaaca tgtcccagga gaagttcctg 
?ccccgctgc tggcccStt cgtggagggg gtggccgccg tgctgtccac caccaaggac 
™2*StS tStcaacgt ccagaacgac accgacgtca gctccaacat cctgaacgtg 
IrStctcS cgctgctg^c tggcggcgtc cgcggccagt tcttcccgtc ggaggacctg 
accttctcgg cgctg«gcc ^aaaeacta ctgaccacca tctccacgca gcgcgtgctg 
cSSS acScftS SSSSS cc^gcgaga actacatgaa £gcgtgtcc 
SSgcgat tcgacagctc cgcgcccttc ctcagctcca ccaccgtgct ^ttccggccc 
ItccaccSca tcaacggcct gcgctgccgc tgcccgcccg gcttcaccgg cgactactgc 
^ratlll tcaacctctg ctactccgac ccgtgcggcg ccaacggccg ctgccgcagc 
SSagggcg gc?acacSg cgagtgc?tc gaggacttca ctggagagca ctgtgaggtg 
lltalcllct caggccgctg tgccaacggg gtgtgcaaga acgggggcac ctgcgtgaac 
SSca?cg gcggct?cca ctgcgtgtgt cctoctggcg agtatgagag sccctactgt 
ctgcucatcy _ f^nt-i-rrThra ccttccaqag cctgagacag 

3ag ss« :=s:?c" ac«=s« ...^5? .*,£=,*«= 
S3 is =5= 2= 3=22 SM 

Sa =5=2 =s5s ssss g= 1 

gJttgtgaca caaccatggc tgtgcgcttt ggaaaggaca tcgggaacta cagctgcgct 

KKS S2SS =1 = I SS 

PiPiiisiiii^ 

So&iS agga„g=gt tc„,«g.g j-o-jtj g= .= c,gct«=g= 
S«g"g= tgSSgtt gcggggacc gacggggagt ggc.ccacct gctgatcgag 

SBBBiHi 

SEES SEES SSS5 ssss: sss= 

ssss ssssss ssss ™s 

^JttaS Saccaccat ctccttcgtg gacctcaggg ccatgaatga gaacrctgagc 
cgcaatSga Saggtgga cggcgccagg a-ctgcagc tggtgagggc ? ctgcgcagt 
nrt-acacaac acacgggcac gctctttggc aatgacgtgc gcacggccta ccagctgccg 
ggccacgtcc ttcaJScga Sagctggcag cagggcttcg -ctggcagc -cgcaggac 
gccgac? t tc acgaggacgt catccactcg ggcagcgccc tcctggcccc agccacc gg 

SEES SSSE S=K = SSSSS = S£ < s 
SS c ~ga SSSS SSSS SSEK 

^°^Kcc c^accgactt cttcagacca cctgaagaaa aagaaggccc cctgctgagg 

Ebs SEES ssss ssssss ss 

S SSES SSSSS SSSS 

agSggggg c?c°gctoc? g.gacccctg gagaggcccg tcctggtgga g«cg=cctg 
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ctggaggtgg 
ggtgggacgg 
gtcgcctgcc 
gagaacgggg 
gcagccctgc 
cacagcattc 
gggatcaacc 
atctacatga 
accgaggtgc 
atcccggcca 
gacttctgct 
gctgttataa 
aagcaccatt 
ctgctgctca 
agctttcact 
cactgcgtgc 
ctgcacctgg 
aacaccacct 
tcgctggaca 
gtgaggggca 
cctggccacg 
gcctcctcac 
ccggccaggg 
gccggctggc 

cgcctgaagg 
cgtggagagt 
cccccagagc 
ctgacggagc 
cccacatcct 
aagagccctg 
actgggagcg 
ccacacaggc 
tggagtgggc 
cacaaaggtc 
aggccacagg 
cagtcaaagc 
ggaacgccgg 
cagtgcggac 
aatcacttcg 
gaggtgcaac 
caatgatctc 
gttgacgaaa 
aaccaagcgg 
ctccccgccg 
atctgccccg 
ccctgctgag 
tctgtgacca 
gatggctttg 
tcctgatttg 
taactctctg 
tacttcattt 
caacggttac 
ggcacagcgt 
ttagcgtttc 
caaataggag 
gttttaaact 
aaagcgtatc 
cctgaatgaa 
agtcccttta 
aacagtctcc 



aggagcgaac 
gagggtggtc 
agtgcagcca 
aggtcctgcc 
tggtggcctt 
acaagcacct 
agacggaaaa 
gcacctttgc 
gcaacatcga 
ttgtcacagg 
ggctgtcgct 
tcatcaacac 
attatgggaa 
tcagcgccac 
acctcttcgc 
tcaaccagga 
aggactccgc 
tcggtgacgg 
gcatcgtcag 
gccacggaga 
attccgactc 
actcgtcaga 
gcgccgtcca 
ccgaccagag 

tggagaccaa 
accccccgga 
agaggaaagg 
agacgctgaa 
cgcgcacgtc 
ggagggagcc 
cccaggccga 
tgcggcatca 
ccagagtgtg 
ttggttctcc 
catgagggag 
cagaactgag 
gcacagacca 
ccgtggctgg 
ctatgtggga 
ctgtatatat 
ttgctgtctt 
ctggtgcccc 
ctagaagccc 
gccccctggg 
gctgtgtgaa 
gtcactgtgg 
ccaccccctc 
tcctttgctc 
gacagaggtg 
agactgcgag 
tgcttcaaat 
caaactgttt 
ggcccgaggg 
ctttgttctt 
aaaccataaa 
tgaccctgat 
tgaaaatggt 
ttatacatgt 
aacgtttgcc 
tttcatgact 



caagcctgtc 
tgcccggggc 
cacagccagc 
tctgaagatt 
cgtcctcctg 
cgccgtggcg 
cccgtttctg 
ctggaccctc 
cacggggccc 
actggcggtc 
tcaagacacc 
agtcacttct 
aaaagggatc 
ctggctgctg 
catctbcagc 
ggtccggaag 
caccaccagg 
gcctgacatg 
ggatgaaggg 
gccagacgcg 
agatagcgag 
cagcgaggac 
cagcaccccc 
cctggctgag 

ggtcagcgtg 
ccaggagagc 
catcttgaaa 
gggccggctc 
ttccctgggc 
ggggcgtgac 
tggctccgac 
ccctcagacc 
gcggtcccca 
caggagctca 
gcgtggacca 
aggggacccc 
aagaccgcgg 
cagcccgggc 
aaggtggaga 
attgcattcg 
ctctgtcaag 
agcagatcaa 
tacagctgcc 
tccccagtgt 
ttcaagacag 
ggcacggttg 
ccctcaccgc 
ctgctccccg 
tggggccttc 
gatctccagg 
ggccaattgt 
ttggaaattc 
gctgtggaac 
caaatcaggt 
ataggatgaa 
gaaaatgtga 
ccaggacagg 
gccttaccag 
cctggcgttt 
ttaacaggat 



tgcgtgttct 
tgcgagctcc 
tttgcggtgc 
gtcacctatg 
agcctggtcc 
ctcttcctct 
tgcacagtgg 
gtggagagcc 
atgcggttct 
ggcctggacc 
ctgatttgga 
gtcctatctg 
gtctccctgc 
gggctgctgg 
ggcttacagg 
cacctgaagg 
gccaccctgc 
ctgcgcacag 
atccagaagc 
tccctcatgc 
ctgtccctgg 
gatggggtgg 
aaaggggacg 
agtgacagtg 

gagctgcacc 
gggggcgcag 
aataaagtca 
cgggagaagc 
tctggcggcc 
cacctcaacg 
tctgagaaac 
ttggagccca 
tggtggcagc 
gggcctgtca 
ctgggccagc 
agactgggcc 
tccagccccg 
agtcctttgc 
tacttttata 
tgctgacttt 
attgcacagt 
aggtgggaaa 
ttcggccagg 
tcgctgtgtg 
ggcagtgcag 
ccacacggct 
ctcccaggtg 
tgggacctgg 
caggccgtta 
cagggttctc 
gcagagggac 
acaccaaggt 
gggtcccgga 
gcccaaataa 
aatcaagtaa 
gcactgttag 
aggatgaaat 
gtgagtggtc 
cctaagtacg 
tctgccccct 



ggaaccactc 
tgtccaggaa 
tcatggatat 
ccgctgtgtc 
gcatgctgcg 
ctcagctggt 
ttgccatcct 
tgcatgtcta 
actacgtcgt 
cccagggcta 
gctttgcggg 
caaaggtttc 
tgaggaccgc 
ctgtgaaccg 
gccccttcgt 
gcgtgctcgg 
tgacgcgctc 
acttgggcga 
tcggcgtgtc 
ccaggagctg 
atgagcagag 
gagctgagga 
ctgtggccaa 
aggaccccag 

gcgaggagca 
ccaggcttgc 
cctacccgcc 
tggccgactg 
ccgactgcgc 
gggtggccat 
cgtgaggcaa 
aggggccact 
cccccgactg 
gacctggtga 
accgctgagt 
cagaggctgg 
cccaggcggg 
aaaggcaccc 
tatttgtatg 
gttatcccga 
tgtacttgaa 
tacgtcagca 
aagtgaggat 
tgcgtttgtc 
cactaggcag 
gtcatttttc 
gcccgggagc 
gaccttaaag 
catacctcct 
ccctctggag 
aaagccacag 
cgggcccact 
actgtcagac 
gtgatcagca 
aatgcaaaga 
cagatgccta 
gagatcccag 
tttcgaagat 
aaaaggtttt 
gaggtgtaat 



cctggccgtt 
ccggacacat 
ctccaggcgt 
cttgtcactg 
ctccaacctg 
gttcgtgatt 
cctccactac 
ccgcatgctg 
gggctggggc 
cgggaacccc 
gcccatcgga 
ctgccaaaga 
attcctcctg 
cgatgcactg 
cctccttttc 
cgggaggaag 
cctcaactgc 
gtccaccgcc 
ctctgggctg 
caaggatccc 
cagctcttac 
aaaatgggac 
ccacgttccg 
cggcaagccc 

gggcagtcac 
tagcagccag 
gccgctgacg 
tgagcagagc 
catcacagtc 
gaatgtgcgc 
gcccgtcacc 
gcccttgaag 
atcatccaga 
caagtgccaa 
cctaagactg 
ccagagttca 
catctcatgg 
cttgtcttaa 
ggactctgag 
gagatccatg 
tctggcatgt 
gtggggctaa 
ggtgtgggcc 
ctctgctgcc 
gtgtgaggag 
acctggtcat 
tgcaggtggg 
cgttgcaggt 
gccaattctc 
tctgaccaat 
ccacactctt 
gcaggcagct 
atgtttgatt 
cagctgcttc 
tgtccacact 
tgggagagga 
agtcctcaca 
aaaaaactct 
taagtcttcg 
ttttttgttc 
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tatttttttc 
actgttacca 
attacttaag 
attaatgttc 
gaatttgtac 
cagggtggcc 
tccctcagcc 
cctagatggg 
tcttcagttt 
agacgttaga 



cacgtactcc 
aaaaacaact 
actttatggg 
aaaacactgg 
ttctgcggtg 
gtggaaggga 
agtggaacgt 
cagtttttgt 
tcttttcttg 
cctcttgatg 



acagccaaca 
gtcagtttta 
agagattaga 
aattacaaat 
ctggtttttc 
cggtttacgg 
gccggaactt 
ctttcaccaa 
cactgatctt 
ttttcccact 



tcacgaggtg 
ttgagatggg 
cactggaggt 
gagaagagtc 
tccacaaaca 
acgtgcagct 
tttgtccatt 
atttgaggac 
tctcctctcc 
ggtccctgag 



taatttttaa 
aaaaatgtaa 
ttttaacaga 
tacaataaat 
cccccgcccc 
gagctgtccg 
ccctagtagg 
tttttttttt 
ttctgtgact 
gctctgttc 



tttgatcaga 
acctatttfct 
acgtgtattt 
taagattttt 
tccccatgcc 
tgtcccatgc 
cctgccacag 
tgccattatt 
ccagtgactc 
11389 
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<210> 238 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> N1XL_014246 



<400> 238 

gggagagatt agacactgga ggtttttaac agaacgtgta tttattaatg ttcaaaacac 



60 



<210> 239 

<211> 4372 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM__014314 



<400> 239 

tagttattaa 

cctgctttcc 

cctgcgggga 

gcctgcaagc 

acatggcccc 

agggcccaat 

gctggttccg 

ccattgaaag 

aacgtttaca 

ctgaatgttt 

tgatggcagg 

ccaaaacttt 

tagagaaagg 

acatacagat 

caccttcaga 

tagagcttgc 

gtggaaaaac 

gacaaaaggg 

ctgtattctc 

cagctgagaa 

cacagattct 

tgatgatatt 

ttaattatct 

tgactgcctc 

gcaagctgtg 

aactggagca 

gcgacaaatt 

gaatctgcaa 

aatatgaaca 

aagatgaaga 



agttcctatg 
ccgctcgcca 
acgtagctag 
cttccaggat 
ctggtttagg 
ggaggctgcc 
tggctttttg 
ttgggatttc 
accagaattt 
aattaatcag 
tgcagagaaa 
gaaacttgct 
tataaaagat 
tttctaccaa 
agtgtctgat 
tttgcctgct 
ctttgtttca 
gaaagttgtc 
aaaatacttt 
tgtcccagtg 
tgtgaacaac 
tgatgaatgc 
agatcagaaa 
ggttggtgtt 
tgcttctctt 
agttgtttat 
taaatacatc 
agacctcgaa 
atggattgtt 
gagcaggatt 



cagctccgcc 
cgccctcctc 
ctgcaagcag 
tatatccgga 
gaggaagagg 
acactttttc 
gatgccctag 
aaaaaaattg 
aaaaccagaa 
gaatgtgaag 
ttggtggaat 
ttggagaaag 
gttgaaacag 
gaagatccag 
acaaacttgt 
atgaaaggaa 
ctgcttatat 
ttttttgcga 
gaaagacatg 
gaacagattg 
cttaaaaagg 
cacaacacta 
cttggaggat 

ggggatgcca 

gatgcgtcag 
aagccccaga 
atagctcagc 
aacttatctc 
acagttcaga 
tgtaaagccc 



tcgcgtccgg 
ctacccggct 
aggccggcat 
agaccctgga 
tgcagtatat 
tcaagttcct 
accatgcagg 
aaaagttgga 
ttatcccaac 
aaattctaca 
gccttctcag 
aaaggaacaa 
aagatcttga 
aatgccagaa 
acagcccatt 
aaaacacaat 
gtgaacatca 
atcagatccc 
ggtatagagt 
ttgagaacaa 
gaacgattcc 
gtaaacaaca 
cttcaggccc 
aaaacacaga 
tgatagcaac 
agtttttcag 
tgatgaggga 
aaattcaaaa 
aagcatgcat 
tgtttttata 



cctcatttcc 
ttaaagctag 
gaccaccgag 
ccctacctac 
tcaggctgag 
gttggagctc 
ttattctgga 
ggagtataga 
cgatatcatt 
gatttgctct 
atcagacaag 
gttcagtgaa 
ggataagatg 
tcttagtgag 
taaaccaaga 
aatatgtgct 
tcttaaaaaa 
agtgtatgaa 
tacaggcatt 
tgacatcatc 
atcactatcc 
cccgtacaat 
actgccccag 
tgaagccttg 
agtcaaacac 
gaaagtggaa 
cacagagagt 
tagggaattt 
ggtgttccag 
cacttcacat 



tcggaaaatc 
tgaggcacag 
cagcgacgca 
atcctgagct 
aaaaacaaca 
caggaggaag 
ctttatgaag 
ttacttttaa 
tctgatctgt 
actaagggga 
gaaaactggc 
ctgtggattg 
gaaacttctg 
aattcatgtc 
aattaccaat 
cctacaggtt 
ttcccacaag 
cagcagaaat 
tctggagcaa 
attttaactc 
atctttactt 
atgatcatgt 
gtcattgggc 
gattatatct 
aatctggagg 
tcacggatta 
ctggcaaaga 
ggaacacaga 
atgccagaca 
ttgcggaaat 
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ataatgatgc cctcattatc agtgagcatg cacgaatgaa agatgctctg gattacttga 1860 
aagacttctt cagcaatgtc cgagcagcag gattcgatga gattgagcaa gatcttactc 1920 
agagatttga agaaaagctg caggaactag aaagtgtttc cagggatccc agcaatgaga x980 
atcctaaact tgaagacctc tgcttcatct tacaagaaga gtaccactta aacccagaga 
caataacaat tctctttgtg aaaaccagag cacttgtgga cgctttaaaa aattggattg 
aaggaaatcc taaactcagt tttctaaaac ctggcatatt gaatggacgt ggcaaaacaa 2160 
atcagaacac aggaatgacc ctcccggcac agaagtgtat attggatgca ttcaaagcca 2220 
qtggagatca caatattctg attgccacct cagttgctga tgaaggcatt gacattgcac 2280 
agtgcaatct tgtcatcctt tatgagtatg tgggcaatgt catcaaaatg atccaaacca 
gaggcagagg aagagcaaga ggtagcaagt gcttccttct gactagtaat gctggtgtaa 
ttgaaaaaga acaaataaac atgtacaaag aaaaaatgat gaatgactct attttacgcc 
ttcagacatg ggacgaagca gtatttaggg aaaagattct gcatatacag actcatgaaa 
aattcatcag agatagtcaa gaaaaaccaa aacctgtacc tgataaggaa aataaaaaac 
tgctctgcag aaagtgcaaa gccttggcat gttacacagc tgacgtaaga gtgatagagg 
aatgccatta cactgtgctt ggagatgctt ttaaggaatg ctttgtgagt- agaccacatc 
ccaagccaaa gcagttttca agttttgaaa aaagagcaaa gatattctgt ^cccgacaga 
actgcagcca tgactgggga atccatgtga agtacaagac atttgagatt ccagttatea 2820 
aaattgaaag ttttgtggtg gaggatattg caactggagt tcagacactg tactcgaagt 
ggaaggactt tcattttgag aagataccat ttgatccago agaaatgtcc aaatgatatc 
aggtcctcaa tcttcagcta cagggaatga gtaactttga gtggagaaga aacaaacata 
gtgggtataa tcatggatcg cttgtacccc tgtgaaaata tattttttaa aaatatcttt 
agcagtttgt actatattat atatgcaaag cacaaatgag tgaatcacag cactgagtat 
tttgtaggcc aacagagctc atagtacttg ggaaaaatta aaaagcctca tttctagcct 
tctttttaga gtcaactgcc aacaaacaca cagtaatcac tctgtacaca ctgggataga 
tgaatgaatg gaatgttggg aatttttatc tccctttgtc tccttaacct actgtaaact 
ggcttttgcc cttaacaatc tactgaaatt gttcttttga aggttaccag tgactctggt 
Sccaaatcc actgggcact tcttaacctt ctatttgacc tctgcgcatt tggccctgtt 
gagcactctt cttgaagctc tccctgggct tctctctctt ctagttctat tctagtcttt 
ttttattgag tcctcctctt tgctgatccc ttccaagggt tcaatatata tacatgtata 
tactgtacat atgtatatgt aactaatata catacataca ggtatgtata tgtaatggtt 
atatgtactc atgttcctgg tgtagcaacg tgtggtatgg ctacacagag aacatgagaa 3660 
cataaagcca tttttatgct tactactaaa agctgtccac tgtagagttg ctgtatgtag 3720 
caatgtgtat ccactctaca gtggtcagct tttagtagag agcataaaaa tgataaaata 3780 
cttc?tgaaa aottagttta ctatacatct tgccctatta atatgttctc ttaacgtgtg 3 840 
ccattgttct ctttgaccat tttcctataa tgatgttgat gttcaacacc tggactgaat 3900 
gtctgttctc agatcccttg gatgttacag atgaggcagt ctgactgtcc tttctacttg 
aaagattaga atatgtatcc aaatggcatt cacgtgtcac ttagcaaggt ttgctgatgc 
ttcaaagagc ttagtttgcg gtttcctgga cgtggaaaca agtatctgag ttccctggag 
atcaacggga tgaggtgtta cagctgcctc cctcttcatg caatctggtg ^agtggtg 
caggcgggga gccagagaaa cttgccagtt atataacttc tctttggctt ttcttcatct 
gtaaaacalg gataatactg aactgtaagg gttagtggag agtttttaat taaaagaatg 
tgtgaaaagt acatgacaca gtagttgctt gataatagtt actagtagta gtattcttac 
taagacccaa tacaaatgga ttatttaaac caaaaaaaaa aaaaaaaaaa aa 43/^ 

<210> 240 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<3 08> WM_014314 

<400> 240 ^x.,.^,^^^ cr. 

agttcagaca ctgtactcga agtggaagga ctttcatttt gagaagatac catttgatcc 60 

<210> 241 

<211> 1647 

<212> DNA 

<213> Homo sapiens 

<300> 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
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<308> NM_014321 
<400> 241 

gcgcgcgggt ttcgttgacc cgcggcgttc acgggaattg ttcgctttag tgccggcgcc 60 

atggggtcgg agctgatcgg gcgcctagcc ccgcgcctgg gcctcgccga gcccgacatg 12 0 

ctgaggaaag cagaggagta cttgcgcctg tcccgggtga agtgtgtcgg cctctccgca 180 

cgcaccacgg agaccagcag tgcagtcatg tgcctggacc ttgcagcttc ctggatgaag 240 

tgccccttgg acagggctta tttaattaaa ctttctggtt tgaacaagga gacatatcag 300 

agctgtctta aatcttttga gtgtttactg ggcctgaatt caaatattgg aataagagac 3 60 

ctagctgtac agtttagctg tatagaagca gtgaacatgg cttcaaagat actaaaaagc 42 0 

tatgagtcca gtcttcccca gacacagcaa gtggatcttg acttatccag gccacttttc 480 

acttctgctg cactgctttc agcatgcaag attctaaagc tgaaagtgga taaaaacaaa 540 

atggtagcca catccggtgt aaaaaaagct atatttgatc gactgtgtaa acaactagag 600 

aagattggac agcaggtcga cagagaacct ggagatgtag ctactccacc acggaagaga 660 

aagaagatag tggttgaagc cccagcaaag gaaatggaga aggtagagga gatgccacat 72 0 

aaaccacaga aagatgaaga tctgacacag gattatgaag aatggaaaag aaaaattttg 7 80 

gaaaatgctg ccagtgctca aaaggctaca gcagagtgat tt cage t tec aaactggtat 840 

acattccaaa ctgatagtac attgecatet ccaggaagac ttgaeggett tgggattttg 9 00 

tttaaacttt tataataagg atcctaagac tgttgccttt aaatagcaaa gcagcctacc 960 

tggaggctaa gtctgggcag tgggctggcc cctggtgtga gcattagacc agecacagtg 1020 

cctgattggt atagecttat gtgetttect acaaaatgga attggaggee gggcgcagtg 1080 

gctcacgcct gtaatcccag cactttggga ggccaaggtg ggtggatcac ctgaggtcag 1140 

gagctcgaga ccagcctggc caacatggtg aaaccccatc tctactaaaa atacaaaaat 12 00 

tagecaggtg tgatggtgca tgcctgtaat cccagctcct cagtaggctg agacaggagc 12 6 0 

atcacttgaa cgtgggaggc agaggttgea gtgagecgag attgcaccac cgcactccag 132 0 

cctgggtgac agagegagae ttatctcata aataaataga tagatactcc agcctgggtg 1380 

acagagegag acttatagat agatagatag atagatggat agatagatag atagatagat 1440 

agatagataa aeggaattgg agecattttg ctttaagtga atggcagtcc cttgtcttat 1500 

tcagaatata aaattcagtc tgaatggcat cttacagatt ttacttcaat ttttgtgtac 156 0 

ggtatttttt atttgactaa atcaatatat tgtacagect aagttaataa atgttattta 162 0 
tatatgeaaa aaaaaaaaaa aaaaaaa 1647 

<210> 242 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_014321 
<400> 242 

tgctttaagt gaatggcagt cccttgtctt attcagaata taaaattcag tctgaatggc 60 

<210> 243 

<211> 1455 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD14364 
<400> 243 

ggcggtccgc acgcacctcg gtaacatcac agcaggtcca ggecaatgat aaccttataa 60 

gaggecatgt egaagegega catcgtcctc accaatgtca ccgttgtcca gttgetgega 12 0 

cagccgtgcc cggtgaccag agcaccgccc ccacctgagc ctaaggctga agtagagece 180 

cagccacaac cagagcccac accagtcagg gaggaaataa agccaccacc gccaccactg 240 
cctcctcacc ccgctactcc tcctcctaag atggtgtctg tggcccggga gctgactgtg 3 00 

ggcatcaatg gatttggacg catcggtcgc ctggtcctgc gcgcctgcat ggagaagggt 3 60 

gttaaggtgg tggctgtgaa tgatccattc attgaccegg aatacatggt gtacatgttt 420 
aagtatgact ccacccacgg ccgatacaag ggaagtgtgg aattcaggaa tggacaactg 480 
gtcgtggaca accatgagat ctctgtctac cagtgcaaag agcccaaaca gatcccctgg 540 
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agggctgtcg ggagccccta cgtggtggag tccacaggcg tgtacctctc catacaggca 600 

gcttcggacc acatctctgc aggtgctcaa cgtgtggtca tctccgcgcc ctcaccggat 660 

gcaccaatgt tcgtcatggg tgtcaatgaa aatgactata accctggctc catgaacatt 72 0 

gtgagcaacg cgtcctgcac caccaactgt ttggctcccc tcgccaaagt catccacgag 7 80 

cgatttggga tcgtggaagg gttgatgacc acagtccatt cctacacggc cacccagaag 840 

acagtggacg ggccatcaag gaaggcctgg cgagatgggc ggggtgccca ccagaacatc 90 0 

atcccagcct ccactggggc tgcgaaagct gtgaccaaag tcatcccaga gctcaaaggg 960 

aagctgacag ggatggcgtt ccgggtacca accccggatg tgtctgtcgt ggacctgacc 102 0 

tgccgcctcg cccagcctgc cccctactca gccatcaagg aggctgtaaa agcagcagcc 10 80 

aaggggccca tggctggcat ccttgcctac accgaggatg aggtcgtctc tacggacttc 114 0 

ctcggtgata cccactcgtc catcttcgat gctaaggccg gcattgcgct caatgacaat 12 0 0 

ttcgtgaagc tcatttcatg gtacgacaac gaatatggct acagtcaccg ggtggtcgac 12 60 

ctcctccgct acatgttcag ccgagacaag tgaaacggga aggtcctttc tttccttccc 1320 

aggggccggg gccggaacat gtgcctcccg ttccagcatc tggctgcccg ggggaggaag 13 80 

gacacccggg gcgggcgccc cacgccgatg ggtccatggt gaaataaaaa acagtgctcg 1440 
aaaaaaaaaa aaaaa 1455 



<210> 244 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_014364 



<400> 244 



cgctcaatga caatttcgtg aagctcattt catggtacga caacgaatat ggctacagtc 



<210> 245 
<211> 935 
<212> DNA 
<213> Homo sapiens 



60 



<300> 

<308> NM_014462 



<400> 245 

gaagtgggta agggtaatat ggaggagctt ccggcaggcc ccggcggctg aaagccgggg 60 

cagaagtgct ggtctcggtc gggattccgg gcttggtccc accgaggcgg cgactgcggt 12 0 

aggagggaag aggttttgga cgcgctggcc tcccgccgct gtgcattgca gcattatttc 180 

agttcaaaat gaactatatg cctggcaccg ccagcctcat cgaggacatt gacaaaaagc 240 

acttggttct gcttcgagat ggaaggacac ttataggctt tttaagaagc attgatcaat 3 00 

ttgcaaactt agtgctacat cagactgtgg agcgtattca tgtgggcaaa aaatacggtg 36 0 

atattcctcg agggattttt gtggtcagag gagaaaatgt ggtcctacta ggagaaatag 42 0 

acttggaaaa ggagagtgac acacccctcc agcaagtatc cattgaagaa attctagaag 48 0 

aacaaagggt ggaacagcag accaagctgg aagcagagaa gttgaaagtg caggccctga 540 

aggaccgagg tctttccatt cctcgagcag atactcttga tgagtactaa tcttttgccc 600 

agaggctgtt ggctcttgaa gagtaggggc tgtcactgag tgaaagtgac atcctggcca 6 60 

cctcacgcat ttgatcacag actgtagagt tttgaaaagt cacttttatt tttaattatt 72 0 

ttacatatgc aacatgaaga aatcgtgtag gtgggttttt tttttaataa caaaatcact 7 80 

gtttaaagaa acagtggcat agactccttc acacatcact gtggcaccag caactacttc 840 

tttatattgt tcttcatatc ccaaattaga gtttacaggg acagtcttca tttacttgta 9 00 
aataaaatat gaatctcaaa aaaaaaaaaa aaaaa 93 5 



<210> 246 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_014462 
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ttaataacaa aatcactgtt taaagaaaca gtggcataga ctccttcaca catcactgtg 6 0 

<210> 247 
<211> 890 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_014501 



<400> 247 

ggcggaccga agaacgcagg aagggggccg 
tgaactccaa cgtggagaac ctacccccgc 
cgacactgac cgcagaccca cccgatggca 
ccgacctcca ggtcaccatc gagggccctg 
gcatgaaact cctgctgggg aaggacttcc 
ccaagatctt ccacccgaac gtgggcgcca 
gggactggac ggctgagctg ggcatccgac 
tccaccctaa ccccgagtct gcactcaacg 
acgaggagta tgcggctcgg gcccgtctgc 
ccagcggcag ggccgaagcc ggtcgggccc 
accctggggc cccagggggc ccgggagggg 
gcgagcgcga taagaagctg gcggccaaga 
cgctgcggcg gctgtagtgg gctctcttcc 
cccctccctc caactctgtc tctaagttat 
agggggcact gggacctgga tttgtttttc 



gggggacccg cccccggccg gccgcagcca 60 

acatcatccg cctggtgtac aaggaggtga 120 

tcaaggtctt tcccaacgag gaggacctca 180 

aggggacccc atatgctgga ggtctgttcc 240 

ctgcctcccc acccaagggc tacttcctga 3 00 

atggcgagat ctgcgtcaac gtgctcaaga 3 60 

acgtactgct gaccatcaag tgcctgctga 420 

aggaggcggg ccgcctgctc ttggagaact 480 

tcacagagat ccacgggggc gccggcgggc 540 

tggccagtgg cactgaagct tcctccaccg 600 

ctgagggtcc catggccaag aagcatgctg 66 0 

aaaagacgga caagaagcgg gcgctgcggg 72 0 

tccttccacc gtgaccccaa cctctcctgt 780 

ttaaattatg gctggggtcg gggagggtac 840 
taaataaagt tggaaaagca 890 



<210> 248 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_014501 



<400> 248 

acacgtactg ctgaccatca agtgcctgct gatccaccct aaccccgagt ctgcactcaa 

<210> 249 
<211> 1182 
<212> DNA 



60 



<213> Homo sapiens 



<300> 

<308> NM_016095 



<400> 249 

gcggccggcg gcgtctcctc ccgggacgct 
cctgcagctc gcgccgccat ggacgctgcc 
gttaccatta tccccaactt cagtctggac 
ccttttaacc ctggtttacc cgtggaagtg 
agacagaaat gtcgcctgct ccctccagag 
agggatcatg aacgaaagga agaaactttt 
cttacgaagc tcctgttaaa tcatgcttca 
accctggtca aggatatgtg ggacactcgt 
tttgtgagac agcaggaggc acatgccaag 
accagcggga ctttcctcac acaagcgctc 



gaggggcccg aggagaccgt gaggctctgg 60 

gaggtcgaat tcctcgccga gaaggagctg 12 0 

aagatctacc tcatcggggg ggacctgggg 180 

cccctgtggc tggcgattaa cctgaaacaa 240 

tggatggatg tagaaaagtt ggagaagatg 3 00 

accccaatgc ccagccctta ctacatggaa 3 60 

gacaacatcc cgaaggcaga cgaaatccgg 42 0 

atagccaaac tccgagtgtc tgctgacagc 480 

ctggataact tgaccttgat ggagatcaac 540 

aaccacatgt acaaactccg cacgaacctc 600 
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cagcctctgg agagtactca gtctcaggac ttctagagaa aggcctggtg caggcggctt 
gctgggggat gtgagcgctc aggatgtgat gaggtactcg tggttctgga gctctagaaa 
cacttctgat gcatgaaaaa tgtgtgatgg tgcaaggaat ggattcagga tgttgttgga 
gaaacaagtt tgtgattagt ccttaaaact tagctccctg ggacattctt caattccaca 
tctgtttcta gaaaccagcc ctttttcccc ccacttttga gaaataaaaa agccttaggt 
aaataagtca ttctccctag cagagccact tgggtctcct gcatggaagc cgtcacactt 
gggcaggtgt tcagtgactg gtaggtgtag atacagcagg agtggccatg tggtccacgg 
ctttttaccc cttcttgatc ctgatttctt gggctgaatt tagactctct cacagaggtg 
gctcacagag aaggatggca gatggtgcag ccaacaatgc tgaccggtgc ttatcctcta 
agccctgatc cacaataaaa atggacccaa ctcaaaaaaa aa 1182 

<210> 250 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_016095 
<400> 250 

atggattcag gatgttgttg gagaaacaag tttgtgatta gtccttaaaa cttagctccc 6 

<210> 251 
<211> 704 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_016185 
<400> 251 

tgcagcggtg gtcggctgtt gggtgtggag tttcccagcg cccctcgggt ccgacccttt 
gagcgttctg ctccggcgcc agcctacctc gctcctcggc gccatgacca caaccaccac 
cttcaaggga gtcgacccca acagcaggaa tagctcccga gttttgcggc ctccaggtgg 
tggatccaat ttttcattag gttttgatga accaacagaa caacctgtga ggaagaacaa 
aatggcctct aatatctttg ggacacctga agaaaatcaa gcttcttggg ccaagtcagc 
aggtgccaag tctagtggtg gcagggaaga cttggagtca tctggactgc agagaaggaa 
ctcctctgaa gcaagctccg gagacttctt agatctgaag ggagaaggtg atattcatga 
aaatgtggac acagacttgc caggcagcct ggggcagagt gaagagaagc ccgtgcctgc 
tgcgcctgtg cccagcccgg tggccccggc cccagtgcca tccagaagaa atccccctgg 
cggcaagtcc agcctcgtct tgggttagct ctgactgtcc tgaacgctgt cgttctgtct 
gtttcctcca tgcttgagaa ctgcacaact tgagcctgac tgtacatctt cttggatttg 
tttcattaaa aagaagcact ttatgtaaaa aaaaaaaaaa aaaa 704 

<210> 252 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_016185 
<400> 252 

tgaaccaaca gaacaacctg tgaggaagaa caaaatggcc tctaatatct ttgggacacc 

<210> 253 
<211> 2268 
<212> DNA 

<213> Homo sapiens 
<220> 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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<221> Modif ied__base 
<222> 1 . . . 2268 
<223> n = a, c,g, or t 

<300> 

<308> NM_016359 



<400> 253 
gggatttgaa 
gaatcgcgat 
agaacttagc 
ccttgaaagg 
aaacttctgc 
agagacagcc 
accctgactc 
atcatgaaaa 
acgagcacca 
cagaaggaaa 
ctgcaatcac 
ccattgatca 
actgaagcag 
actctctgtg 
tgcaagtcag 
taaaacgggt 
caagactcca 
ggctgtgctt 
cccattcaag 
tgatcttaaa 
accatggggg 
ctacaagaaa 
cgagcaagaa 
ggctgaagat 
cttttgtaaa 
ctaactgttc 
gacgttatca 
ttcaggtttc 
ccaagttccc 
atttacctga 
gggtaagctg 
aaacggggcc 
cttatgtacc 
aaaaataaaa 
gaaatgcgag 
ctggtctcaa 
cttagtatca 
agatcgtcat 



ccncgctgac 
gatcatcccc 
caagagtctg 
ctacattaaa 
atcctcttgt 
acttggccat 
acagcagaat 
gcaggaaagc 
agaagctgag 
gaaatctctc 
tactccaaac 
atatattgag 
cagcccatca 
gcttctactc 
agtaccttgg 
gtcaggtttt 
gccagaaagt 
gggacacaca 
ttgacaactg 
gcaagtttgt 
caatctaaag 
acttacaaac 
cgaaaggaga 
taataatttt 
tttttttttt 
atagtctgtg 
ccttaaagct 
taacgagacc 
cctgctggtt 
tggcagctag 
ggatagaaag 
atatagtttg 
ttcgttcaaa 
ctctacaact 
gattaatgct 
actccaccaa 
tgtccatgtt 
aaataaaatt 



gaagtttggt 
tctctagagg 
ggtctccggg 
catgaggcaa 
gatgagactg 
gtcaccaaaa 
cattcagaga 
caggatctca 
aatgctgttt 
tacacagatg 
tttaagaagc 
agaaaaagaa 
ataagggagg 
ccatcagcca 
gtctgaaggg 
cagctgctac 
ctgcacatgt 
aattaaagac 
aggcaacgca 
ctcgtcccct 
aaaataatta 
aaccccatct 
agaaagcaaa 
ttaatatctt 
tttgctgtca 
tagtgtccat 
caaattcttt 
catcctaaaa 
ctaatattaa 
ttatgcaagc 
gccacctctt 
gttatgacat 
tatcctcatg 
cattctaaca 
ttaatgcttt 
atgtacttct 
tccttcacct 
ttttctcatt 



gatccatctt ccgagtatcg ccgggatttc 60 
agctggactc cctcaagtac agtgacctgc 120 
ccaacctgag ggcaaccaag ttgttaaaag 180 
gaaaaggaaa tgagaatcag gatgaaagtc 24 0 
agatacagat cagcaaccag gaagaagctg 300 
caaggagaag gtgcaagact gtccgtgtgg 36 0 
taaaaataag taatcccact gaatfcccaga 42 0 
gagctactgc aaaagttcct tctccaccag 480 
cctcaggtaa cagagattca aaggtacctt 540 
agtcatccaa acctggaaaa aataaaagaa 600 
ttcatgaagc tcattttaag gaaatggagt 660 
acattttgaa gaacacaatt ccatgaatga 72 0 
ggtcaggact ccagtacctc caagaggaag 7 80 
acgacgctcg caaggccggt cttgtggccc 840 
gtcactcaag cgctctgcta tctctgcagc 9 00 
taaagataat gagcataagc gttcactgac 960 
gaccgtgtct gggggcaccc caaaaggcga 1020 
catcacgggg aattctgctg ctgttattac 1080 
gactccagtc tccaataaga aaccagtgtt 1140 
caactatgaa ccacacaaag gaaagctaaa 1200 
tctaaatcaa catgtcaaca gaattaactt 12 60 
ccagacaaag gaagagcaac ggaagaaacg 132 0 
ggttttggga atgcgaaggg gcctcatttt 13 80 
gtaaatattc ctgtattctc aacttttttc 144 0 
tccccacttt agtcacgaga tctttttctg 1500 
gggttcttca tgtgctatga tctctgaaaa 1560 
gggatggttt ttacttaagt ccattaacaa 162 0 
ttctgtttct agatttttaa tgtcaagttc 1680 
cagaactgca gtcttctgct agccaatagc 1740 
ttcaggagaa tttgaacaat aacaagaata 180 0 
cactctctat agaatatagt aacctttatg 186 0 
caatatttta cctaggtgaa attgtttagg 192 0 
taattgccat ctgtcactca ctatattcac 19 80 
ttgcttactt aaaagctaca tagccctatc 2 040 
tagagacagg gtctcactgt gttgcccagg 2100 
tattcatttt atggaaaaga ctaggctttg 2160 
cagtggagct tctgagtttt atactgctca 222 0 
gtcaaaaaaa aaaaaaaa 2268 



<210> 254 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_016359 
<400> 254 

acattgctta cttaaaagct acatagccct atcgaaatgc gaggattaat gctttaatgc 6 0 

<210> 255 
<211> 1590 
<212> DNA 
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<213> Homo sapiens 



<300> 

<308> NM_016816 



<400> 255 

gaggcagttc tgttgccact ctctctcctg tcaatgatgg atctcagaaa taccccagcc 6 0 

aaatctctgg acaagttcat tgaagactat ctcttgccag acacgtgttt ccgcatgcaa 12 0 

atcgaccatg ccattgacat catctgtggg ttcctgaagg aaaggtgctt ccgaggtagc 180 

tcctaccctg tgtgtgtgtc caaggtggta aagggtggct cctcaggcaa gggcaccacc 2 40 

ctcagaggcc gatctgacgc tgacctggtt gtcttcctca gtcctctcac cacttttcag 3 00 

gatcagttaa atcgccgggg agagttcatc caggaaatta ggagacagct ggaagcctgt 3 60 

caaagagaga gagcactttc cgtgaagttt gaggtccagg ctccacgctg gggcaacccc 42 0 

cgtgcgctca gcttcgtact gagttcgctc cagctcgggg agggggtgga gttcgatgtg 480 

ctgcctgcct ttgatgccct gggtcagttg actggcagct ataaacctaa cccccaaatc 540 

tatgtcaagc tcatcgagga gtgcaccgac ctgcagaaag agggcgagtt ctccacctgc 6 00 

ttcacagaac tacagagaga cttcctgaag cagcgcccca ccaagctcaa gagcctcatc 660 

cgcctagtca agcactggta ccaaaattgt aagaagaagc ttgggaagct gccacctcag 720 

tatgccctgg agctcctgac ggtctatgct tgggagcgag ggagcatgaa aacacatttc 7 80 

aacacagccc aaggatttcg gacggtcttg gaattagtca taaactacca gcaactctgc 840 

atctactgga caaagtatta tgactttaaa aaccccatta ttgaaaagta cctgagaagg 900 

cagctcacga aacccaggcc tgtgatcctg gacccggcgg accctacagg aaacttgggt 96 0 

ggtggagacc caaagggttg gaggcagctg gcacaagagg ctgaggcctg gctgaattac 1020 

ccatgcttta agaattggga tgggtcccca gtgagctcct ggattctgct ggctgaaagc 1080 

aacagtacag acgatgagac cgacgatccc aggacgtatc agaaatatgg ttacattgga 1140 

acacatgagt accctcattt ctctcataga cccagcacgc tccaggcagc atccacccca 12 0 0 

caggcagaag aggactggac ctgcaccatc ctctgaatgc cagtgcatct tgggggaaag 12 6 0 

ggctccagtg ttatctggac cagttccttc attttcaggt gggactcttg atccagagaa 13 2 0 

gacaaagctc ctcagtgagc tggtgtataa tccaagacag aacccaagtc tcctgactcc 13 80 

tggccttcta tgccctctat cctatcatag ataacattct ccacagcctc acttcattcc 1440 

acctattctc tgaaaatatt ccctgagaga gaacagagag atttagataa gagaatgaaa 1500 

ttccagcctt gactttcttc tgtgcacctg atgggagggt aatgtctaat gtattatcaa 15 60 
taacaataaa aataaagcaa ataccaaaaa 159 0 

<210> 256 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM„016816 



60 



<400> 256 , t ^ 

cgatcccagg acgtatcaga aatatggtta cattggaaca catgagtacc ctcatttctc 

<210> 257 

<211> 2905 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_016817 

<400> 257 t cn 

cggcagccag ctgagagcaa tgggaaatgg ggagtcccag ctgtcctcgg tgcctgctca 60 

gaagctgggt tggtttatcc aggaatacct gaagccctac gaagaatgtc agacactgat 120 

cgacgagatg gtgaacacca tctgtgacgt ctgcaggaac cccgaacagt tccccctggt 180 

gcagggagtg gccataggtg gctcctatgg acggaaaaca gtcttaagag gcaactccga 2 40 

tggtaccctt gtccttttct tcagtgactt .aaaacaattc caggatcaga agagaagcca 3 00 

acgtgacatc ctcgataaaa ctggggataa gctgaagttc tgtctgttca cgaagtggtt 3 60 
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gaaaaacaat ttcgagatcc agaagtccct tgatgggtcc accatccagg tgttcacaaa 420 

aaatcagaga atctctttcg aggtgctggc cgccttcaac gctctgagct taaatgataa 480 

tcccagcccc tggatctatc gagagctcaa aagatccttg gataagacaa atgccagtcc 540 

tggtgagttt gcagtctgct tcactgaact ccagcagaag ttttttgaca accgtcctgg 600 

aaaactaaag gatttgatcc tcttgataaa gcactggcat caacagtgcc agaaaaaaat 660 

caaggattta ccctcgctgt ctccgtatgc cctggagctg cttacggtgt atgcctggga 720 

acaggggtgc agaaaagaca actttgacat tgctgaaggc gtcagaacgg ttctggagct 780 

gatcaaatgc caggagaagc tgtgtatcta ttggatggtc aactacaact ttgaagatga 840 

gaccatcagg aacatcctgc tgcaccagct ccaatcagcg aggccagtaa tcttggatcc 900 

agttgaccca accaataatg tgagtggaga taaaatatgc tggcaatggc tgaaaaaaga 960 

agctcaaacc tggttgactt ctcccaacct ggataatgag ttacctgcac catcttggaa 102 0 

tgtcctgcct gcaccactct tcacgacccc aggccacctt ctggataagt tcatcaagga 1080 

gtttctccag cccaacaaat gcttcctaga gcagattgac agtgctgtta acatcatccg 1140 

tacattcctt aaagaaaact gcttccgaca atcaacagcc aagatccaga ttgtccgggg 12 0 0 

aggatcaacc gccaaaggca cagctctgaa gactggctct gatgccgatc tcgtcgtgtt 12 6 0 

ccataactca cttaaaagct acacctccca aaaaaacgag cggcacaaaa tcgtcaagga 132 0 

aatccatgaa cagctgaaag ccttttggag ggagaaggag gaggagcttg aagtcagctt 13 80 

tgagcctccc aagtggaagg ctcccagggt gctgagcttc tctctgaaat ccaaagtcct 144 0 

caacgaaagt gtcagctttg atgtgcttcc tgcctttaat gcactgggtc agctgagttc 1500 

tggctccaca cccagccccg aggtttatgc agggctcatt gatctgtata aatcctcgga 15 60 

cctcccggga ggagagtttt ctacctgttt cacagtcctg cagcgaaact tcattcgctc 162 0 

ccggcccacc aaactaaagg atttaattcg cctggtgaag cactggtaca aagagtgtga 1680 

aaggaaactg aagccaaagg ggtctttgcc cccaaagtat gccttggagc tgctcaccat 17 40 

ctatgcctgg gagcagggga gtggagtgcc ggattttgac actgcagaag gtttccggac 1800 

agtcctggag ctggtcacac aatatcagca gctcggcatc ttctggaagg tcaattacaa 1860 

ctttgaagat gagaccgtga ggaagtttct actgagccag ttgcagaaaa ccaggcctgt 192 0 

gatcttggac ccaggcgaac ccacaggtga cgtgggtgga ggggaccgtt ggtgttggca 19 80 

tcttctggac aaagaagcaa aggttaggtt atcctctccc tgcttcaagg atgggactgg 2 040 

aaacccaata ccaccttgga aagtgccgac aatgcagaca ccaggaagtt gtggagctag 2100 

gatccatcct attgtcaatg agatgttctc atccagaagc catagaatcc tgaataataa 2160 

ttctaaaaga aacttctgga gatcatctgg caatcgcttt taaagactcg gctcaccgtg 2220 

agaaagagtc actcacatcc attcttccct tgatggtccc tattcctcct tcccttgcct 22 80 

tcttggactt cttgaaatca atcaagactg caaacccttt cataaagctg ccttgctgaa 2340 

ctcctctctg caggagccct gcttaaaata gttgatgtca tcactttatg tgcatcttat 2400 

ttctgtcaac ttgtattttt ttttcttgta tttttccaat tagctcctcc tttttccttc 2460 

cagtctaaaa aaggaatcct ctgtgtcttc aaagcaaagc tctttacttt ccccttggtt 252 0 

ctcataactc tgtgatcttg ctctcggtgc ttccaactca tccacgtcct gtctgtttcc 2580 

tctgtataca aaaccctttc tgcccctgct gacacagaca tcctctatgc cagcagccag 2 640 

gccaaccctt tcattagaac ttcaagctct ccaaaggctc agattataac tgttgtcata 2 700 

tttatatgag gctgttgtct tttccttctg agcctgcctt tatcccccca cccaggagta 2760 

tcctcttgcc aaagcaaaag actttttcct tggctttagc cttaaagata cttgaaggtc 2 82 0 

taggtgcttt aacctcacat accctcactt aaacttttat cactgttgca tataccagtt 2 880 
gtgatacaat aaagaatgta tctgg 2 9 05 

<210> 258 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> 3SIM_016817 
<400> 258 

aaggtctagg tgctttaacc tcacataccc tcacttaaac ttttatcact gttgcatata 60 



<210> 259 

<211> 2054 

<212> DNA 

<213> Homo sapiens 
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360 
420 
480 



780 
840 
900 
960 



<300> 

<308> NM_017414 
<400> 259 

gggaagctcg ggccggcagg gtttccccgc acgctggcgc ccagctcccg gcgcggaggc 60 
cgctgtaagt ttcgctttcc attcagtgga aaacgaaagc tgggcggggt gccacgagcg 120 
cggggccaga ccaaggcggg cccggagcgg aacttcggtc ccagctcggt ccccggctca 180 
gtcccgacgt ggaactcagc agcggaggct ggacgcttgc atggcgcttg agagattcca 240 
tcgtgcctgg ctcacataag cgcttcctgg aagtgaagtc gtgctgtcct gaacgcgggc 3 00 
caggcagctg cggcctgggg gttttggagt gatcacgaat gagcaaggcg tttgggctcc 
tgaggcaaat ctgtcagtcc atcctggctg agtcctcgca gtccccggca gatcttgaag 
aaaagaagga agaagacagc aacatgaaga gagagcagcc cagagagcgt cccagggcct 
gggactaccc tcatggcctg gttggtttac acaacattgg acagacctgc tgccttaact 540 
ccttgattca ggtgttcgta atgaatgtgg acttcaccag gatattgaag aggatcacgg 600 
tgcccagggg agctgacgag cagaggagaa gcgtcccttt ccagatgctt ctgctgctgg 660 
agaagatgca ggacagccgg cagaaagcag tgcggcccct ggagctggcc tactgcctgc 720 
agaagtgcaa cgtgcccttg tttgtccaac atgatgctgc ccaactgtac ctcaaactct 
ggaacctgat taaggaccag atcactgatg tgcacttggt ggagagactg caggccctgt 
atacgatccg ggtgaaggac tccttgattt gcgttgactg tgccatggag agtagcagaa 
acagcagcat gctcaccctc ccactttctc tttttgatgt ggactcaaag cccctgaaga 
cactggagga cgccctgcac tgcttcttcc agcccaggga gttatcaagc aaaagcaagt 1020 
gcttctgtga gaactgtggg aagaagaccc gtgggaaaca ggtcttgaag ctgacccatt 1080 
tgccccagac cctgacaatc cacctcatgc gattctccat caggaattca cagacgagaa 114 0 
agatctgcca ctccctgtac ttcccccaga gcttggattt cagccagatc cttccaatga 12 00 
agcgagagtc ttgtgatgct gaggagcagt ctggagggca gtatgagctt tttgctgtga 
ttgcgcacgt gggaatggca gactccggtc attactgtgt ctacatccgg aatgctgtgg 
atggaaaatg gttctgcttc aatgactcca atatttgctt ggtgtcctgg gaagacatcc 
agtgtaccta cggaaatcct aactaccact ggcaggaaac tgcatatctt ctggtttaca 144U 
tgaagatgga gtgctaatgg aaatgcccaa aaccttcaga gattgacacg ctgtcatttt 1500 
ccatttccgt tcctggatct acggagtctt ctaagagatt ttgcaatgag gagaagcatt 1560 
gttttcaaac tatataactg agccttattt ataattaggg atattatcaa aatatgtaac 1620 
catgaggccc ctcaggtcct gatcagtcag aatggatgct ttcaccagca gacccggcca 1680 
tgtggctgct cggtcctggg tgctcgctgc tgtgcaagac attagccctt tagttatgag 1740 
cctgtgggaa cttcaggggt tcccagtggg gagagcagtg gcagtgggag gcatctgggg 1800 
gccaaaggtc agtggcaggg ggtatttcag tattatacaa ctgctgtgac cagacttgta 1860 
tactggctga atatcagtgc tgtttgtaat ttttcacttt gagaaccaac attaattcca 1920 
tatgaatcaa gtgttttgta actgctattc atttattcag caaatattta ttgatcatct 1980 
cttctccata agatagtgtg ataaacacag tcatgaataa agttattttc cacaaaaaaa 2 040 
aaaaaaaaaa aaaa 2 054 

<210> 260 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_017414 



1260 
1320 
1380 



<400> 260 

tgagcatctc ttctccataa gatagtgtga taaacacggt catgaataaa gttattttcc 



60 



<210> 261 

<211> 3638 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_017523 



<400> 261 

ggtagatgcg gctgtgacag cagcaaagaa tgacggccaa gggcgacagc aggggctggc 



60 
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catgctgtaa 
ccgagaattc 
tcagttttgt 
acatggaagg 
tcaccctcca 
ctgtccccaa 
cgatgtgtca 
aggagcgccc 
tccacgagtc 
tgcaccgcat 
aaggggaaag 
ttccagaaaa 
aacactttcc 
aaaatcaaac 
ttcctcttca 
atccactttt 
cctatgacat 
atcaacatca 
tcagctagat 
gcattcctgc 
tttaaaagaa 
agaaaaggct 
cgtcgaagga 
catactgaat 
cctattcaac 
aggcaaccaa 
ctagaaaacc 
ctttcaggag 
caagctgaga 
tacctaggaa 
ctgctgaaag 
gaatcaatat 
ctatcaaact 
tgaaaccaaa 
ggcatcgcat 
tactggtaca 
agagggccca 
aaatacgcaa 
catatgcaga 
atggattaaa 
caataccatt 
tgtaacgaaa 
gcaatcccat 
tatacacaaa 
tgtccatcag 
tgtaatccca 
accagcctgg 
gtggtggcga 
acctggaagg 
ccaagcgaga 
atactatgca 
ggaagccatt 
acttataagt 
ctggggccta 
ggtactatgc 
aatttaccta 
aaaacgtttg 
aataaaccaa 
tgggccctac 
caaagaataa 



aggggcttct 
tgaagatctc 
ttccttgcct 
agacttctcg 
tgaggcttac 
ggaaaccatg 
gcagagcatg 
tgttgagtgt 
ctactgtggc 
gctcgcccag 
aatttcagct 
taagtatttc 
tgttggaaat 
ttccacgatg 
ttctgaaagt 
gatgtcagag 
tctgaggaga 
ggagaaatgc 
ttggaaaagg 
ctacttgctg 
aaggtttggc 
tttgataaaa 
acatacctca 
gagcaaaagc 
atagtactgg 
aaagaaagga 
ccatgatctt 
acaaaatcaa 
gccaaatcaa 
tacagctaac 
aaatcagaga 
tgttaaaatg 
accaataaca 



aaagagccca 
tacccaactt 
aaagcatggt 
gaaataaagc 
tggggaaaga 
ggattgaaac 
gacttaaatg 
ctggacatag 
gcaaaaattg 
tattggttat 
tgttcacggc 
tggtagaaag 
gcactttggg 
ccaacatggc 
gcacctgtca 
tggtggttgc 
ctctgcctta 
gccataaaaa 
atcctaaatg 
agaagctaaa 
cctgaggtcg 
tttttatctg 
tatagcaagc 
gattattttc 
cgggaaaaaa 
agcctggaga 
aaacacctct 



tgggagggtc 
ctccctccct 
gcaagaaacg 
gtgtgcagga 
tgcctgcggt 
gaggagcact 
cagaagtcct 
aagttctgca 
agccggacag 
cacagagatg 
cctgaaaggg 
caccatatgg 
ccagaaattc 
gagaaagatg 
tcatcaaaga 
cccaagccca 
tgttctcagt 
cggtggttag 
aaaggtacta 
tggtggtctt 
agaactaaaa 
ttcaacttga 
aaataataag 
tggagcatta 
aagtcctcgc 
agtcgaagta 
ggcccaaaag 
tatacaaaat 
gaatgcaatc 
cagggagatg 
tgacacaaat 
gccatactac 
ttcttcacag 
aaaagccaaa 
caaactatac 
gctggtacaa 
tacacaccta 
attccccatt 
tgaaccactt 
taaaacccca 
gaacggaaaa 
acaaatggga 
atacccaaag 
agcactatac 
gataaagaaa 
aggctgaggc 
gaaactccgt 
tcccagctac 
agtgagccga 
aaaaaaaaaa 
agaatgggat 
aactcactca 
cattgagtac 
gagcatggaa 
gatgatgaaa 
ctacacatgt 
cctctttcga 
gaaaggttcc 
acctggagaa 
ctaaaaaaaa 



cagcctcagg aatcaagggg 
gaagctgtgg gctgggccat 
aaactcaacc gaaagcctgc 
actgtaaaag acatgtagtc 
tcctggtcct gtgtccggag 
gcaagcttga gcaccagcag 
cgctggagtt tcataaggcc 
aactggacat gcagctcagc 
agctctgcca aggctgtggc 
tctgtcgcag tgaacaggcc 
aaatctactg tcattattgc 
gtaaatgttg tccagactca 
ttccttcatc tcttccaagt 
ttcgtccaaa gacaagaagt 
aagcaccaag aagcaaaaac 
ggaccagctc ccctagagga 
gtggcatcct gcttcccctg 
cttcatcaaa aggaaaacaa 
caaattcaaa agatttcact 
gtgaaaggtg atgggtttta 
acaaaactca cgtatcatct 
cttcatgtta aaaaccctca 
agccatctat gacaaaacca 
ctcttgagaa gtagaacaag 
cacagcaatc aggcaagaga 
tctctgtttg cagacgatat 
ctcctagatc tgataaacaa 
atggtagcat ttttatacac 
ctattcacaa ttgccacaaa 
aaagatctct acaacaaaaa 
ggaaaaacat tccatactta 
ccaaagcaat ttatagattc 
aatcagaaaa aaaaagcatt 
gcaatcctaa gcaaaaagaa 
tacagggcta cagtaaccaa 
aagcagacac atagatcaat 
caaccatcta atctttgaca 
cagtaagtgg tactgggata 
ccttacacca tatgcaaaaa 
aactataaaa actctggaag 
agatttcatg acaaagatcc 
catgattaaa cagaattacc 
gaatctaaat cattctgtca 
acaatcgcaa agtcagggaa 
atgtggtggc agggagtggt 
gggtggttca cctgaggtca 
ctccgctaaa aatacgaaaa 
ttgggaggcc taggcgtgag 
gatcctgcca ctgcactcca 
aaagaaaatg tggcacatat 
catgtcctgt gcagcaacgt 
gaaacagaaa accaaatacc 
acatggatac aaagaaggga 
ggagggtgag gatcaaaaaa 
taatctgtac aacaaaccct 
gcccctgaac ctaaaaaaaa 
acaaagacat tggtttgccc 
agttttgtct gaaaattctg 
tcctacaccc acagaacccg 
aaaaaaaa 3 63 8 



aactcctgag 
cggaaaactt 
agagagcaga 
tctgccaact 
tgtgaggagc 
gttgggtgta 
aatgagtgcc 
aagctggagc 
cagttcatca 
cagctcggga 
aaccaaatga 
gagtttaaga 
caagctgctg 
ataaacagat 
aaaaccttgg 
gataaagcag 
ccgatcctaa 
gtgagaaatt 
tttaacactg 
ttcgttgggc 
caatagatac 
acaaaccagg 
cagccaacat 
gcacttcagt 
aagaaataaa 
gattctatat 
cttcagctaa 
caacgacatc 
aagaataaaa 
ttacaaaaca 
tggataggaa 
aatgctattc 
aaaatttatt 
caaagctgga 
aactgcatga 
ggaacagaat 
aagttgacaa 
actagctagc 
tcaactcaag 
ataacctagg 
caaaaataat 
atttgactca 
taaagacata 
tcaaactaaa 
ggctcatgtc 
ggagtttgag 
ttagccaggc 
aatcgcttga 
gcctgggcaa 
acaccatgga 
ggatggagct 
acatgttctc 
accgcagaca 
ctacctatct 



ggtgacatgc 
agttaaaaga 
aaggactaca 
attaagcctc 
gctttgtccc 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
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<210> 262 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_017523 
<400> 262 

ttggaaaagg aaaggtacta caaattcaaa agatttcact tttaacactg gcattcctgc 

<210> 263 

<211> 2461 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> N1YL018410 



<400> 263 
atgctgggta 
aagctcaggg 
aaccagccct 
cagggattga 
gactcctcca 
cctgagcttc 
gccacgtcag 
cctttgaaaa 
gcagagtatt 
ccttcactgg 
agtcctggtg 
tcctccacag 
agcagcagct 
agtgacctgt 
agcatcatct 
tataagagca 
aaggagaact 
aagaacgtat 
gaagtcaaca 
ccctcgaagt 
gaaaatagat 
acaatacgac 
cttcatcggg 
tcctgggcca 
gaaacccgca 
tttgaaaacc 
tcttcatcac 
gaccttcacg 
actctttcag 
aaagaagaat 
gtgcctttat 
gaaggcttct 
tcatcacccc 
tctacatgtg 
aaaagaccca 
tcagatgggg 
tcagaagaga 
gaaaaattgg 
cttgctctct 
gacttctctg 
ttcttggtgt 



cgctgcgcgc 
ccagtcgccg 
tcgaggacac 
gaatttgggg 
tgaagcccgc 
cctcgcaccg 
accaggaaga 
atgaattaag 
ttgagtgtgc 
cctcacctgc 
acccagcgaa 
acatggcctt 
tcttaagcag 
acgcagggat 
ccaccaaaac 
ggatgaacaa 
tcataccctg 
tagatgtttc 
gaccccaaat 
attcttcctt 
ttaggacatt 
agggccatgg 
aatattgcct 
tgaacatgta 
ggctgagttt 
taggcaaaag 
ttccaaagac 
ttcagggaaa 
tcccagataa 
ttgacaagct 
gtattggagt 
taggaaaatt 
tggggtgcag 
ttgctcgtgc 
ggctatcaga 
tggacaacac 
gaggagagaa 
aaactaaaag 
gtttgtattt 
cccttaaagt 
agaaattatg 



catggagggc 
ccgcttccag 
cccggtggtg 
tggaagacta 
ggacaggaca 
cacagtcctg 
gtcagttgct 
aaggaaatac 
aggtaacaga 
cgtgcctgcc 
accagcttca 
agtacctaga 
ccagcccttt 
gctgcactcc 
gttcatcatg 
aacatattgc 
ctctgagcct 
ttgccgtaag 
ccataagtta 
gatttacttc 
aaaatggtta 
agagaaccgt 
gagtcccagg 
cagagggggt 
accttccagc 
atctctggaa 
caaccccaca 
tagttctgga 
agaagtgcca 
tcatcaaaag 
gtctacagat 
aaatccagac 
aaaaagtcta 
catcacgagg 
accccagggc 
cgtcagaccg 
cacgtcttac 
tgtgtagcta 
ttgttttgtt 
aaatattagt 
tgaataaagt 



gaggacgtgg 
aggcgcatgc 
caaatggcca 
ataaaggaaa 
gatggctccg 
ggagccgatt 
tgggccttag 
ttgacccaag 
gctggaaggg 
cccggatact 
tctcccagag 
aatgacagcc 
gaagatgatg 
atgagccggc 
caaaactgga 
aaaggagcca 
gtgaaaggga 
acaggtttaa 
gatccaagtt 
gactccagtg 
atttctcctg 
cagagggaga 
aaccagcctc 
cctgcgagtc 
aaagcaaaag 
gcaggtaggt 
cacagcgcaa 
atatttagaa 
ggccacggaa 
tattgcctca 
aaagcaagta 
cctcacttcc 
ctgggctcaa 
gatggcacga 
tccggacgcc 
ggagaccagg 
aggatggaag 
ggttatttcg 
tttgattctt 
gaaattggtt 
tgctcaatta 



aagacgacca 
agcggctgat 
cgctgaccta 
gaaacaaagg 
tgcaagctgc 
caaaaagcgg 
cacctgcagt 
tggatatact 
atgtacgtgt 
gcagtcgtat 
aatgggatcc 
tctccctaca 
acatttgcaa 
tgttgagcac 
actgcaggag 
gacgttctca 
caggggcatt 
aattggaaaa 
ggaaggagcg 
caacatataa 
taaaaatagt 
ttgaaatccg 
gccggatgtg 
ctggtggcct 
caaaaagttt 
gcctgcccaa 
ctcgcccgca 
agtcagtgtc 
ggaatcgtta 
aatctcctgg 
tggaagttcg 
agggtttcca 
ctgcaattga 
gggaccatca 
agggcaattc 
gcagctcttc 
agaaaagtga 
gagtgttatt 
gagactgtga 
ccatcagaga 
gaaaaaaaaa 



gctgctgcag 
agagaagtac 
cgagacgcca 
agagatccag 
agcctggggt 
tgaggtcgat 
gcctcaaagc 
gctacaaggt 
gactccgctg 
ctccggaaag 
tttgcatcct 
agagaccagt 
tgtgaccatc 
aaagccatca 
gaggcacaga 
gaggagctcc 
aagagattgc 
agcttttctt 
caaagtgaca 
tcttgatgag 
ttccagacca 
atttgatcag 
cctcccggac 
tcagggctta 
aagtgaggct 
gagcgattca 
gcagacatct 
acccagcaaa 
cgatgaaatt 
gcagatgaca 
atatcaaaca 
gaagttgcca 
ggctccttca 
gttccctgca 
cctgggtgcc 
acagcccaac 
tttcatgcta 
tatcttccca 
ggacttggtt 
taacctcgag 
aaaaaaaaaa 
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a 2461 

<210> 264 

<2H> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_018410 
<400> 264 

agtgatttca tgctagaaaa attggaaact aaaagtgtgt agctaggtta tttcggagtg 

<210> 265 

<211> 1405 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_018455 
<400> 265 

cacctcgctc gcagcctccc cagcgcagca gcccggctgt gggcctgcgg cagccgggtc 
ttcctggtcc ccacctcctg gggccgacgg gcggcaggaa ggggctcggc gggacgcgcc 
gtcagggacc tgaggaggaa caacggaacg cgttcggaac ggcctggact cccgagactc 
acccgactcg tggccacacc gggagaactg aagcggcagt agccggcgga gacgcccgac 
ccgaaggccg gctgctaggg agcagacagc tgaaccgctt gccagacgcc gaaacccagt 
gacgccctcc accgctccac cgtgctcccg gctccccgcc cccgccgccc gcgggcccca 
aggcgcatgc gccgcctgtc ctggaggggc ccatttccgt ccgtcgtggg gggaggcaca 
gtgagtccac tggggcacgg cagcgtctaa gccacaagcc gagcacataa gccaggtcct 
aacggagcct atgtgtaagt ccactactgg tgcaaggttg cacacttcta agaagagcgg 
cgtggggggc tcggcgacct tcgcttcagt cgctcccccg tgcagtcccc tgtgcccaag 
acacagcctg atgcttgtgc tccggtgggc ggagcttgga ggcggcggga actgcaattg 
gtggctttga aggcgcggcg agcgggaaca gctcttgagg agtgagactg caggagatgt 
gggccgtgcc aaagagatgg atgagactgt tgctgagttc atcaagagga ccatcttgaa 
aatccccatg aatgaactga caacaatcct gaaggcctgg gattttttgt ctgaaaatca 
actgcagact gtaaatttcc gacagagaaa ggaatctgta gttcagcact tgatccatct 
gtgtgaggaa aagcgtgcaa gtatcagtga tgctgccctg ttagacatca tttatatgca 
atttcatcag caccagaaag tttgggatgt ttttcagatg agtaaaggac caggtgaaga 
tgttgacctt tttgatatga aacaatttaa aaattcgttc aagaaaattc ttcagagagc 
attaaaaaat gtgacagtca gcttcagaga aactgaggag aatgcagtct ggattcgaat 
tgcctgggga acacagtaca caaagccaaa ccagtacaaa cctacctacg tggtgtacta 
ctcccagact ccgtacgcct tcacgtcctc ctccatgctg aggcgcaata caccgcttct 
gggtcaggag ttagaagcta ctgggaaaat ctacctccga caagaggaga tcattttaga 
tattaccgaa atgaagaaag cttgcaatta gtgaacatga aaggaaaata aaaattcctc 
acagtcaaaa aaaaaaaaaa aaaaa 1405 

<210> 266 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_018455 
<400> 266 

ccgacaagag gagatcattt tagatattac cgaaatgaag aaagcttgca attagtgaac 

<210> 267 
<211> 927 
<212> DNA 
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<213> Homo sapiens 
<300> 

<308> NM_018465 



<400> 268 

ggaacagagt agattcttca tagacaaatg aaatcatgct taccaatcaa atctcaaagc 

<210> 269 

<211> 1047 

<212> DNA 

<213> Homo sapiens 



60 

120 

180 

240 

300 



<400> 267 

ggcagcgggc gaaaggagcc ggggcctgga ggtttgcgta ccggtcgcct ggtcccggca 
ccagcgccgc ccagtgtggt ttcccataag gaagctcttc ttcctgcttg gcttccacct 
ttaacccttc cacctgggag cgtcctctaa cacattcaga ctacaagtcc agacccagga 
aagcaaggcc cagaaagagg tcaaaatggg gtttatattt tcaaaatcta tgaatgaaag 

catgaaaaat caaaaggagt tcatgcttat gaatgctcga cttcagctgg aaaggcagct 

catcatgcag agtgaaatga gggaaagaca aatggccatg cggattgcgt ggtctcggga 3 60 
attcctcaaa tattttggaa ctttttttgg ccttgcagcc atctctttaa cagctggagc 420 
gattaaaaaa aagaagccag ccttcctggt cccgattgtt ccattaagct ttatcctcac 480 
ctaccagtat gacttgggct atggaaccct tttagaaaga atgaaaggtg aagctgagga 540 
catactggaa acagaaaaga gtaaattgca gctgccaaga ggaatgatca cttttgaaag 6UU 
cattgaaaaa gccagaaagg aacagagtag attcttcata gacaaatgaa atcatgctta 660 
ccaatcaaat ctcaaagcac agaattattg acttgaatca tggtttttac agttttttaa 720 
atgctcaaga ttttgatatt atagatttta ttttaaaata ttaaaatgca agatagtttt 7 80 
gagctatttt aaaataaaat ttataacatt caacacaaaa tcatggaggt gctctaaata 
acttttagat ttcctctctc tgtgtgcatt accaatatct aagtgtaaaa ttaataaatt 
gttttgaatt cctggaaaaa aaaaaaa 927 

<210> 268 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM__018465 



840 
900 



60 



<300> 

<308> NM_018487 
<400> 269 

cccacttctc cagccagcgc cccagccctc 
cagactgtgt ccctgacaat gggaacagcc 
cagcacaccc acatcgatgt gcacatccac 
acctgctgct ctgcgctgcg gccccgggcc 
gtggcctcgt gggtgatgca gatcgtgctg 
ttctacatcc gcgactacac cctcctcgtc 
gtggctgtgc tggctggagc tgctgccttc 
gccctgctga ggactctgct aacgctggca 
ctttggaatg aagatttccg atatggctac 
agctcgagtg actggaacac tccagccccc 
cacctatgta cctccttcat ggacatgctg 
ctcttgggtg tctggattct gctgcttctg 
tggagaatgt tcccaaccaa agggaaaaga 
atctagccat gcctctcctg attattagtg 
tctgactgct ggaagaagaa ccagactgag 
cctggcccca tgaccgtggc cacagccctg 



ccgccgcccg ctcgcaggtc ccgaggagcg 60 

gacagtgatg agatggcccc ggaggcccca 120 

caggagtctg ccctggccaa gctcctgctc 180 

acccaggcca ggggcagcag ccggctgctg 240 

gggatcttga gtgcagtcct aggaggattt 300 

acctcgggag ctgccatctg gacaggggct 360 

atttacgaga aacggggtgg tacatactgg 42 0 

gctttctcca cagccatcgc tgccctcaaa 480 

tcttattaca acagtgcctg ccgcatctcc 540 

actcagagtc cagaagaagt cagaaggcta 600 

aaggccttgt tcagaaccct tcaggccatg 660 

gcatctctga cccctctgtg gctgtactgc 72 0 

gaccagaagg aaatgttgga agtgagtgga 7 80 

cctggtgctt ctgcaccggg cgtccctgca 840 

gaaaagaggc tcttcaacag ccccagttat 9 00 

ctccagcagc acttgcccat tccttacacc 960 
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ccttccccat cctgctccgc ttcatgtccc ctcctgagta gtcatgtgat aataaactct 1C 
catgttattg ttcccaggaa aaaaaaa 1047 

<210> 270 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_018487 
<400> 270 

aaccaaaggg aaaagagacc agaaggaaat gttggaagtg agtggaatct agccatgcct 6 

<210> 271 
<211> 2280 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> U17077 



<400> 271 



ccgcccgcca 
atccctttcg 
gccgccaccc 
tcgtttctca 
gaatcctgga 
gctgccgtcc 
tactacatta 
catgccttca 
ttgaaagctc 
aacggaagat 
gacaccacac 
tttcctggga 
actcacatca 
gtttggggat 
ctgatctttt 
caaggctgga 
gccattctcc 
ggctaattta 
cgatctcctg 
tgagccaccg 
atctaacgca 
tggtcccgag 
ttcccctccc 
tgagggtaga 
ggggaatggg 
agcagctgcc 
cagaaagcac 
ccgcattgga 
ggaaccgcgc 
ctccgaggct 
acgggcaaag 
ttttctaaag 
tgcacgtgcc 
actggtttga 
cagggacaga 
tcactttctg 
aaaagcttct 



ccagctacgc 
ccttcttcct 
acatagtata 
tctccttgat 
gagttctgga 
tacaagtaca 
attcggcagc 
gcatctatta 
caattattgg 
ccactaaaac 
tttgtttgga 
aaacaactgt 
aagccctcac 
tttgcctcag 
ttttgtttgt 
gtgcagtgac 
tgcctcagcc 
gttgtatttt 
acctcgtgat 
tgcccggcct 
gatgcataga 
ttccctggga 
ctttcctctc 



cgtgtgggga 
gagggaaggg 
ggacgcctgg 
cctcagggaa 
agaaaaggct 
tgcgccacct 
ggcggaatgg 
ttgtgcacac 
aaagctgtgt 
ttctgtcatt 
ggccttacag 
actacaggag 
atcactctga 
atgtgtctct 



cccgtccgac 
gcccgagctg 
ccccttgctg 
gttcctgttg 
cagcctgtac 
tgccacgatt 
ctcgttcttc 
ccactgatgc 
tccccaaaag 
gtccacggga 
catttaaatt 
ctcttggaat 
tccactaatg 
ctatgggctt 
tttattttgt 
gcgatctcca 
tcccgagtag 
tagtagagat 
ccgcccgcct 
gattctctta 
ccttgttcct 
ggctgacagg 
cagcattagg 
aaataaatca 
catagcctgc 
gtcaggaatc 
aaaggtgaaa 
gccgcaacat 
gcactcaccc 
tggtgcccac 
taaaatatca 
tcttctgttg 
gggaatgaaa 
ttttgtatct 
tcatgggaaa 
tacttttttt 
ccttttgttg 



gtgccctcgg 
atatttgggt 
caaggatggg 
tcttacttgt 
cacgggacca 
gtttctgaga 
gccttcatcg 
acaggcgcca 
cagcttccaa 
ttaacagaac 
cactctgctg 
tatctgacca 
gagaatccta 
ccctagagta 
tttttttgag 
ctcactgcag 
ccgggactac 
ggggtttcac 
cggcctccca 
aaattgaaga 
ggtacttgtt 
gtcaagccac 
attcaaggga 
ttatacagta 
tcctccatga 
caagacccca 
acattacatc 
ctcagcgagg 
ccctcacatt 
ggggttgggc 
aatcaaggtg 
acccagacga 
taaattatta 
acatttttcc 
gaaaattctg 
tttttttttt 
cctggcagct 



gggtcgcgct 
tcttggtctg 
tgatgtatgt 
ttggatttta 
ctggcatcct 
aactgctgga 
ccacgctgct 
ggccaagggg 
cgtttgccat 
gtccttgcag 
aataggagga 
tgaacttgct 
gccccactaa 
ggtctagggg 
acggagtctc 
gctccgcctc 
aggcgcccac 
cgtattagcc 
aagtgctggg 
ggtgctgcca 
cagcctgtgc 
cctgcccacc 
aatctgcatg 
agacctgggg 
gtctgacatc 
cctcttaagg 
cgtggattct 
agtgaaggac 
ctcttaagca 
aagggctcac 
cttggtttta 
atagggcaca 
cgagaaaggg 
cctcctgggg 
gcttcactac 
ttttgcaacc 
gtctaggatg 



gttcctcacc 
gaccatggta 
ctcgctcacc 
caaaagattt 
gtacatgagc 
cccaagaatt 
ctacattctc 
gaaatgctct 
ctggatgaca 
actgagcgat 
agcttttctt 
cttctagaca 
tgccaagtct 
aatactcagt 
gctcttcctc 
ccgggttccc 
caccatgccc 
aggatggtct 
attacaggcg 
aggccttcag 
tggggagccg 
accctcccac 
aagccaattt 
cttgaggggt 
tcggaaactg 
actggttcct 
cctgccacaa 
ccatgtccca 
cccggtggcc 
caggacctca 
aagtaaatgt 
gccctgtaac 
acttgtccta 
tttgcgggga 
tgctcactgc 
tgataccttg 
atcactgatt 
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actatttact aagtagccac atgcaaataa aagttgtttg gtaaaatgga aaaaaaaaaa 



2280 



<210> 272 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> U17077 

<400> 272 t , cn 

tcaccaggac ctcaacgggc aaagttgtgc acactaaaat atcaaatcaa ggtgcttggt 6 0 

<210> 273 

<211> 2554 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> X87949 



<400> 273 
aggtcgacgc 
gagagggaag 
tgtgaccccg 
cctgtggctg 
gccgcgatgc 
gtgggcacgg 
aacggccgcg 
gccttcactc 
aaccccgaga 
tctgtgcagc 
tacattcaag 
gccatggttc 
catgcagttg 
gctggaacta 
attgcttatg 
ggcggaacct 
actaatggag 
atcaaactgt 
aaactccggc 
gaaattgagt 
gaagagctca 
gattctgatt 
attccaaaga 
ataaacccag 
gatcaagata 
actgtaggag 
tctcagatct 
ggtgaaagac 
cctcctgctc 
attcttcgag 
aatgaccaga 
tttgctgagg 
tatgcctatt 
tctgaagata 
caccaagatg 
caaccaatta 
acagcagaaa 
ggactcagga 
atctfccacct 



cggccaagac 
cgccgcggcc 
ggcccctgcc 
gactgcctgc 
tgctgctgct 
tggtcggcat 
tggagatcat 
ctgaagggga 
acacggtctt 
aggacatcaa 
ttgatattgg 
tcactaaaat 
ttactgtacc 
ttgctggcct 
gcctggataa 
tcgatgtgtc 
atactcatct 
ac aaaaagaa 
gcgaggtaga 
ccttctatga 
acatggatct 
tgaagaagtc 
ttcagcaact 
atgaagctgt 
caggtgacct 
gtgtcatgac 
tttctacagc 
ccctgacaaa 
ctcgtggggt 
tgacagctga 
atcgcctgac 
aagacaaaaa 
ctctaaagaa 
aggagaccat 
ctgacattga 
tcagcaaact 
aagatgagtt 
acttttgtta 
cagagtggag 



agcacagaca 
tgtatttcta 
gcctgcaagt 
tgctgcccaa 
cagcgcggcg 
cgacttgggg 
cgccaacgat 
acgtctgatt 
tgacgccaag 
gttcttgccg 
aggtgggcaa 
gaaagaaacc 
agcctatttt 
aaatgttatg 
gagggagggg 
tcttctcacc 



gggtggagaa 
gacgggcaaa 
aaaggccaag 
aggagaagac 
gttccggtct 
tgatattgat 
ggttaaagag 
agcgtatggt 
ggtactgctt 
caaactgatt 
ttctgataat 
agacaatcat 
cccacagatt 
agacaagggt 
acctgaagaa 
gctcaaggag 
tcagattgga 
ggaaaaagct 
agactfccaaa 
ctatggaagt 
gtagacactg 
ggaaaaaatt 
ttgaactgct 



gattgaccta 
gacctgccct 
cggaaattgc 
ctggctggca 
cgggccgagg 
accacctact 
cagggcaacc 
ggcgatgccg 
cggctcatcg 
ttcaaggtgg 
acaaagacat 
gctgaggctt 
aatgatgccc 
aggatcatca 
gagaagaaca 
attgacaatg 
gactttgacc 
gatgtcagga 
gccctgtctt 
ttttctgaga 
actatgaagc 
gaaattgttc 
fctcttcaatg 
gctgctgtcc 
catgtatgtc 
ccaagtaata 
caaccaactg 
cttctgggta 
gaagtcacct 
acagggaaca 
atcgaaagga 
cgcattgata 
gataaagaaa 
gtagaagaaa 
gctaagaaga 
gcaggccctc 
atctgctagt 
gaaagaactt 
atagcctaag 



ttggggtgtt 
tcgcctggtt 
gctgtgctcc 
agatgaagct 
aggaggacaa 
cctgcgtcgg 
gcatcacgcc 
ccaagaacca 
gccgcacgtg 
ttgaaaagaa 
ttgctcctga 
atttgggaaa 
aacgccaagc 
acgagcctac 
tcctggtgtt 
gtgtcttcga 
agcgtgtcat 
aggacaatag 
ctcagcatca 
ccctgactcg 
ccgtccagaa 
ttgttggtgg 
gcaaggaacc 
aggctggtgt 
cccttacact 
cagtggtgcc 
ttacaatcaa 
catttgatct 
ttgagataga 
aaaataagat 
tggttaatga 
ctagaaatga 
agctgggagg 
agattgaatg 
aggaactgga 
ccccaactgg 
gctgtaatat 
aagtctcgaa 
cggctgttta 



tcgcgagtgt 
cgtggcgcct 
tgtgctacgg 
ctccctggtg 



gaaggaggac 
cgtgttcaag 
gtcctatgtc 
gctcacctcc 
gaatgacccg 
aactaaacca 
agaaatttct 
gaaggttacc 
aaccaaagac 
ggcagctgct 
tgacctgggt 
agttgtggcc 
ggaacacttc 
agctgfcgcag 
agcaagaatt 
ggccaaattt 
agtgttggaa 
ctcgactcga 
atcccgtggc 
gctctctggt 
tggtattgaa 
taccaagaac 
ggtctatgaa 
gactggaatt 
tgtgaatggt 
cacaatcacc 
tgctgagaag 
gttggaaagc 
taaactttcc 
gctggaaagc 
agaaattgtt 
tgaagaggat 
tgtaaatact 
tgtaattgga 
ctgcttttca 
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ttagcagttg ctcacatgtc tttgggtggg gggggagaag aagaattggc catcttaaaa 2400 

agcgggtaaa aaacctgggt tagggtgtgt gttcaccttc aaaatgttct atttaacaac 2460 

tgggtcatgt gcatctggtg taggaagttt tttctaccat aagtgacacc aataaatgtt 2520 
tgttatttac actggtcaaa aaaaaaaaaa aaaa 2 554 

<210> 274 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> X87949 
<400> 274 

aactttcctc tgaagataag gagaccatgg aaaaagctgt agaagaaaag attgaatggc 

<210> 275 
<211> 1359 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contigl632 



60 



60 



<400> 275 

ttttaagaca gttacctgtt gtgctgctgt tacaatatat aatgaaacca agtcagggga 
gtgaatttat caatcttttg atgtaaagta aaaacgtagt tcacacttca ggagagaact 120 
tcatagcaca atgtctttct ataagatatt tttaatgatt tagtatttta caacatttgt 180 
ttaccatatt ttgatatacc atttttttct atctgcccag ttttattaaa aaaactatat 240 
attattttct aaagaaacaa tcatattttt atacaaaatt atgttttcag gtaacgaaat 3 00 
agatgtaggg tacagtggaa cataagcagt gttacccctg gctgggagtc agtattatac 3 60 
aacaaatggt gagctggaac atgccctgtc tgtgctgtcc ctcctgtgct gggtcgcgga 420 
tgtgtaggca acattgcctt atcacgctag gttcacctga cactttaaaa ggaaaaaaag 480 
ttccatagag ttctgtggtc acaaaattgt tttgctttta tcaaatactt taatagaacc 540 
aaagttgcag atattggaat gtatggaagt atctcagtct ctgcataaga ggattaaagt 
atgaaaggat catttaatga ctgttttact tataagtcat taagtaatcc accatttctt 
atggatgatg cttaagcctg gtgaggtttg tactctaagg agcccagatc ataatgcagt 
gcatttcctt agcccttaga gtttcttgca aacatttaaa aaaagacata tttaagaaag 
aaagataaag aaaaaacata tttaattact gtaaacaggt actgctttat gtttattttc 
tctctacttc aaccaaaatc agatctttga ggttttgctg acattgttgg tggttttgca 
catgttcttt ctaattggat ttatgaatag ttctatgggt tttcaaagat gaatcatgct 
aagaacactt ctgctttttg atccactgtt tgcagcagaa ttatatatat gtataggaaa 
aatccacttt gaataatcca tgttttgtat ttggaaattg tttttaaaaa taaaaaggaa 
aggaaatata taaagctgtt atttattctg catttcttac atatctatcg cttgtcagta 
tacccgtttt ggtatatatt gcctctgcac atctacattt gtatatgcaa cagtgagctt 12 0 0 
tatatctaca taaactgtaa ataatccttt ctgtgaaagg atcatcatat caagatgata 12 6 0 
ccaaaagtat gtaaaaagaa acctgcatta ttttgtaatt atttcttata gatatttcat 132 0 
ggtaagatta gcagtcaata aagttacttt tttgccttt 13 59 

<210> 276 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contigl632 

<400> 276 cn 
gggttttcaa agatgaatca tgctaagaac acttctgctt tttgatccac tgtttgcagc 60 

<210> 277 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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<211> 994 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig3464 



<400> 277 

tgaatgtata 

actttcaatg 

tgtctgttat 

gtacttttta 

gttactgtaa 

tgacgtgatc 

gaataagctt 

caaatggcgg 

aacttgttct 

gccttttcaa 

tgaaagcttt 

ctgactaaag 

ctttttttct 

agtcaagaca 

ctgcactttt 

aactgtatca 

ggttcaataa 



tattaagact 

ttagacattt 

agctctttaa 

ttcaatgtaa 

tttttttttt 

tttaagactg 

ttatcaaata 

tcttatagtt 

ggcaaaccag 

ctcttaatgc 

tgtatgtatt 

ctgacctaaa 

ttggagggag 

aacacatctg 

tatgacattt 

catttatact 

aattggcaaa 



gtagctgaat 

atccttaagt 

ttcatcagta 

tcagaagctg 

tttttttacg 

ctatatatat 

gataattgat 

tcaattctga 

accctgctgt 

cctccacatg 

actaggtttt 

tggatcagtt 

ggtaacatta 

aaataaagga 

tgaaaatcta 

tgcagaaatt 

actgaacacc 



tgcacatgaa 

tgtgagcgat 

ttaatacaga 

tgatgttttg 

aagcacgtga 

cagtctctta 

gcaatttagg 

aaatagcaaa 

agatatagtc 

aatcagcagt 

gtttttctta 

tatgtgtaat 

tttggacaga 

actgtgtatt 

tttataggta 

tatttcattg 

aaaa 994 



atcagattgc 

atatgtagca 

attatcattt 

cctttgtagt 

ctggactaat 

ctctataagg 

attcacgcaa 

cttaataaac 

taaggtagtt 

taagaaggtt 

tgtttgctga 

attctagtgc 

tgcagaagga 

aacatgttaa 

cagaacaatg 

ttattagtag 



caacttcttg 
tgctgtgaaa 
gcgtttcttg 
cctgtgcttt 
gtaaggcaga 
ttttaaatta 
gtttcagtgt 
agccacttta 
aaccatataa 
ctagaaccca 
ttttacagtt 
tttaatgact 
actgttagtg 
caattcataa 
ggttttgtta 
gaattttatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 



<210> 278 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig3464 

<400> 278 _ ^ n 

ctgctgtaga tatagtctaa ggtagttaac catataagcc ttttcaactc ttaatgccct 6 0 

<210> 279 
<211> 423 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contigl4683 



<400> 279 

tatgttatgg 

agtaatactc 

agtttttcct 

aggtatttga 

cagctgtttg 

ttttgtgttt 

atagaataat 



atatcttatt 
tactaatcaa 
taatctagtt 
gggtgttcca 
aaagctctgt 
tgtttctaaa 
gagtccagga 



ttagagtaag 
tacttagaag 
atgttggtaa 
tttccactga 
atttggctat 
ggaagacagt 
ttaatttggg 



aatataaggc 
tttttgttat 
tttataagtc 
atagtaagat 
aacggaaaac 
gttttgttcc 
ataaagtctt 



atagccatat 
gactaatctg 
agttttcaga 
gatgcttact 
tttgttaggg 
ttctttagaa 
ttacttcata 



ttatgaaggt 60 

aatgcttttt 12 0 

ttaggaaaga 180 

tagatttcca 240 

atgcttgatg 3 00 

aacttgaaga 3 60 

aattctgatt 42 0 



ctg 423 



<210> 280 

<211> 60 

<212> DNA 

<213> Homo sapiens 



131 



WO 2005/086891 



PCT/US2005/007894 



<300> 

<308> Contigl4683 

aggaagacag tgttttgttc cttctttaga aaacttgaag aatagaataa tgagtccagg 6 0 

<210> 281 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig28552 



atgccattga tgtgaagaag gtgtctgtgg aagactttct tactgacctg aataacttca 

gaaccacatt catgcaagca ataaaggaga atatcaaaaa aagagaagca saggaaaaag 120 

aaaaacgtgt cagaatagct aaagaattag cagagcgaga aagactcgaa cgccaacaa 180 

agaaaaagcg tttattagaa atgaagactg agggtgatga gacaggagtg atggataatc 240 

tgctggaggc cttgcagtcc ggggctgcct tccgcgacag aagaaaaagg acaccgatgc 300 

caaaagaSt tcggcagagt ctcagtccaa tgtctcagag gcctgttctg aaagtttgta 360 
accatggtaa taaaccgtat ttataaattg c 391 

<210> 282 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig28552 



60 



aagactttct tactgacctg aataacttca gaaccacatt catgcaagca ataaaggaga 



60 



<210> 283 
<211> 450 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig28947 



ctcatccaag gagctggggc agacttcatt gattctagag agacctgttt cagtgcctac 
tcatccctgc cctctggtgc cagcctcctt accatcacgg cttcactgag gtgtaggtgg 
gtttttctS aacaggagac agtctctccc ctcttacctc aacttcttgg ^gggaatc 
agtgatactg gagatggcta gttgctgtgt tacgggtttg agttacattt ggctataaaa 
cKcttgt? gggaaaaatg tgggggagag gacttcttcc tacacgcgca ^tgagacaga 
ttccaactgg ttaatgatat tgtttgtaag aaagagattc tgttggttga ctgcctaaag 
agaaaggtgg gatggccttc agattatacc agcttagcta gcattactaa ccaactgatg 
gaagctctga aaataaaaga tcttgaaccc 450 



<210> 284 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig28947 
<400> 284 
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agacagattc caactggtta atgatattgt ttgtaagaaa gagattctgt tggttgactg 



60 



<210> 285 
<211> 439 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> Contig30875 



60 

120 

180 



<400> 285 

agaaatcaat gacagttgac aggaagagag gacgcataca acaggcaaaa gaggaatgcc 
cagcagtctt ggtccttgcg gtgcaatact ggccttgagg ccaagtcagc aggggattcg 
tagtcactaa cttctaactg aggcagggaa gtaccatgtt ctggaaaagg tccaaagaaa 
caggaataga ggcagtgtag caagaggcag atttttggtg ccaaatagat ttgaatcctg 240 
gttctgcttc ttcctttgta gagtatgata ttggttcttt cctcccaaag ctattataaa 3 00 
gactaaatat gtacacaaat ctttgggatg tctgacatat aaatgcttaa caataggtat 360 
ttgctggtat tattacaaat gaatttgctt atttttgagc cacttctatg tctgtccatt 420 
aaaccaaaat gtgttctgc 439 



<210> 286 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig30875 



ggttctttcc tcccaaagct attataaaga ctaaatatgt acacaaatct ttgggatgtc 



60 



<210> 287 
<211> 338 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig31221 



<400> 287 ... cn 

gggaagttac actgcttcac accacaaggc cgtgggaaat cttggaggtt ctgtgccttt 6 0 

ctgtcacctc tactttttgc agctgtgatt gcactgtccc gcacatgtga ctacaagcat 120 

cactggcaag gaccctttaa atggtgaaaa tgggcagatg aatagcaata agtggacctt 180 

tgttactctt ctgagttaga aaaattctaa tttagtacac tctgaacaaa gcttattata 240 

cttacttaag atgtgttttg atttggtgtt cagaaagcaa cctgacaatg ataatactgt 300 
aactatgata aaattgagaa taaaaagatt ttatttag 33 8 



<210> 288 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig31221 



<400> 288 

aaatgggcag atgaatagca ataagtggac ctttgttact cttctgagtt agaaaaattc 



60 



<210> 289 
<211> 417 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig31288 



<400> 289 

gaatcacttg agcccgggag gttgaggctg 
agcctgctgg gtaacagagc aagactccat 
cataatgagg ccaggcaaaa aaaaaaaaag 
caaagctgct actgccattg taccagtgtt 

atttttatga cagattttat attgtaacca 
ttaaactacg atttatcata tgctcccagt 
aatgtaaaca accaaggtgc atctggttat 



cagtgagctg tgtttatacc actgcactcc 60 

ctcaaaaaga aaagaaaaaa tgctttgcta 12 0 

tcctgtggaa atcatataga caaacatttg 180 

aaaatgtgtt ctaccttgca tcttttactg 240 

tttgagaact ctgtaagtgc tatggcttcc 3 00 

gtttactttg agactgaatg gcaaccagag 36 0 
gttttaaaat aaagattaat aaaagtt 417 



<210> 290 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig31288 



ggcttcctta aactacgatt tatcatatgc tcccagtgtt tactttgaga ctgaatggca 60 

<210> 291 
<211> 394 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> Contig31646 



<210> 292 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig31646 



gccagaccag aattacacag ttcaaattat tgcatacaat aaagataaag aaagcaagcc 



60 



gctgctacac cccatgtaaa aagcggaaaa taaaatgaag attttccagc gcaagatgcg 
gtactggttg cttccacctt ttttggcaat tgtttatttc tgcaccattg tccaaggtca 120 
agtggctcca cccacaaggt taagatataa tgtaatatct catgacagta tacagatttc 180 
atggaaggct ccaagaggga aatttggtgg ttacaaactt cttgtgactc caacttcagg ,40 
tggaaaaact aaccagctga atctgcagaa cactgcaact aaagcaatta ttcaaggcct 
tatgccagac cagaattaca cagttcaaat tattgcatac aataaagata aagaaagcaa 
gccagctcaa ggccaattca gaattaaaga ttta 394 



300 
360 



60 



<210> 293 
<211> 357 
<212> DNA 
<213> Homo sapiens 



<300> 
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<308> Contig37562 



<400> 293 .... an 

caattatttc aagtgcacct tattaacaaa agtatcagtg gatccaacat aaaattttat bu 

agtactaaat gtcaagccta actgtgaatt ttgttctgta tcttaagtaa atttatgata 12 0 

atgttctcga gctatcaaca aaatatatgt acttttgtga gctatgaatt ttctaattaa 180 

attttacatg ctataacatg atttttacat gaatgatact ttgtttataa ctatcaaatg 240 

tcagtatttt actacaattt tattataaag tgtacattat cactaaatga acttcgattt 300 
taaaaatcaa attagcttta gttgtatatt attttttaca aataaagata gacttgt 3 57 

<210> 294 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig37562 

<400> 294 cn 
atcaaatgtc agtattttac tacaatttta ttataaagtg tacattatca ctaaatgaac 6 0 

<210> 295 

<211> 351 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig37895 

<400> 295 cn 

aatagagaca cctctaatta attaaagcgg atgccctccc cactcctccc aggatttgac 6 0 

tcggagcaca aactcttcac aaaccaaaat gtcaggacac catcgccagt gtccactggc 120 

cactgctgtt ggtgtgaggc agccaggagc ccctcagaac tagtaagtct gagaagaggc 180 

tgcacggggc ctaggagagg gagaaatgag cccgtccaag gtgaattcct tgattctcca 240 

ttgtgagtgc accaagaaca agcactccct ccgactgact ctcgcctacc aggatctgga 3 00 
acaccttcca ttaatttatt cgttcattca ataaatattt attgactgac t 351 

<210> 296 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig37895 

ctctcgccta ccaggatctg gaacaccttc cattaattta ttcgttcatt caataaatat 60 

<210> 297 

<211> 418 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig38288 

60 



<400> 297 

gacaagtaaa tgggggccgt tgggacggcg ggtgcctgga gggcagctct gggctcagcg 
ggcagtgctt agagcacagg cccctctgtt gggggatggg gaggagagca gtctgccctt l^u 
gggagcgtag gccccaggga gacttctaaa gccccccctg tcgtctgctc ttcacccagc 180 
accacagagg cacctgctgc acacacaagc atctcactcg gcccacggag ggggccaggc 240 
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ttcctttgcc tgaagctgtt ttgggaaggg tctccacaca ggcactgatc tcccaagctt 3 00 
tggtcatgat gtcttttacc atttgataat tttaaacatt gtttttaaac ccaaaacatt 3 60 
tagtggtccg ttgcctctga agatgtaaac aaacaaatac actatttctg ggaacatt ' 



418 



<210> 298 
<2ll> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig38288 

tttagtggtc cgttgcctct gaagatgtaa acaaacaaat acactatttc tgggaacatt 60 

<210> 299 
<211> 413 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig38901 



60 
120 



<400> 299 

tacatttttg tttaatgttg ggcctgaggt taactgtgac catggtccag cttgagtggc 
ttctggagca gccacatttt caaggactgt ccaaaagcca gccagttcag ggctcaggcc 
tcacccattg cccactcctg gggagaccat cacctggctc atcgtttcca ccaagagtgc 180 
cccacaggag tgccccacag acccgctgga ccagcctgct gcgggtcctg gccaggggtc 240 
tggctaacgg tgagggctga ctctgaactg tctctcagtc tccagaaagt gttcaagcct 300 
gttgtgttcc caaatctgat tcctcctatt gtcttgtaaa tcaaactcta agtgaaaact 360 
tcccatttgt cccttcaaag attttttttt attaaatggt tttttaagat cct 413 



60 



<210> 300 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig38901 
<400> 300 

tgttcccaaa tctgattcct cctattgtct tgtaaatcaa actctaagtg aaaacttccc 

<210> 301 
<211> 434 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig40434 

Jaatggtgaa agagagatgc cgtgttttga aagtaagatg atgaaatgaa tttttaattc 60 
aagaaacatt cagaaacata ggaattaaaa cttagagaaa tgatctaatt tccctgttca 120 
cacaaacttt acactttaat ctgatgattg gatattttat tttagtgaaa catcatcttg 180 
ttagctaact ttaaaaaatg gatgtagaat gattaaaggt tggtatgatt tttttttaat 240 
gtatcagttt gaacctagaa tattgaatta aaatgctgtc tcagtatttt aaaagcaaaa 300 
aaggaatgga ggaaaattgc atcttagacc atttttatat gcagtgtaca atttgctggg 360 
ctagaaatga gataaagatt atttattttt gttcatatct tgtacttttc tattaaaatc 42 0 
attttatgaa atcc 43 4 
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<210> 302 

<2H> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig40434 



aaggaatgga ggaaaattgc atcttagacc atttttatat gcagtgtaca atttgctggg 60 

<210> 303 
<211> 391 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> Contig40552 



<400> 303 

caccaagccc tgctccggca cctcgaatcc 
ttactgtgcc cccagaccaa gagctccatc 
cttgcagctc cagagccccc tggcccagcc 
gctggctctg agtgtccctt ccctagaaag 
gttttgatgc cctcactagt gttgaagcct 
ggatggtcag ccctacccac ctgccctgtt 
gattttttgt tcaataaaga atttggtaaa 

<210> 304 

<211> 60 

<212> DNA 

<213> Homo sapiens 



ctggcgacca tgagt caeca gctccaagcc 60 

ccccgccctc tgcagcgttt gtctagcgcc 120 

cgtgactcct etttggggee tacagatgaa 180 

gcctgaccct ccttacccac cagaacaggg 240 

gttccagaga gaggtgggac tgcaaggaga 300 

tgagcttcct gtttgacaat gtttgctgtt 360 
a 391 



<300> 

<308> Contig40552 
<400> 304 

tttgagcttc ctgtttgaca atgtttgctg 



ttgatttttt gttcaataaa gaatttggta 6 0 



<210> 305 
<211> 495 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig41413 



<400> 305 

aaatattctt aatagggcta ctttgaatta 
gagacattgc atgaaagatg atgagagata 
tgtattcagt atttgaacgt cgtcctgttt 
agacaatttt taaatctctg taatatgata 
taaagttaat ccagattata tggtccttat 
tettgeattg tgaggtacag gcggaagttg 
agctgaataa tgtgaggatt aacttctgcc 
ggaaacaagt atttcagtct caaaattgaa 
aactgttctt atgtc 495 



atetgecttt atgtttggga gaagaaagct 60 

aatgttgatc ttttggcccc atttgttaat 12 0 

gttgttagtt ttcttcatca tttattgtat 180 

cattttccta tcttttaagt tattgttacc 240 

atgtgtacaa catfcaaaatg aaaggctttg 3 00 

gaatcaggtt ttaggattct gtctctcatt 3 60 

agctcagacc atttcctaat cagttgaaag 42 0 

taatgcacaa gtcttaagtg attaaaataa 480 



<210> 306 
<211> 60 
<212> DNA 
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<213> Homo sapiens 
<300> 

<308> Contig41413 
<400> 306 

cagctcagac catttcctaa tcagttgaaa gggaaacaag tatttcagtc tcaaaattga 

<210> 307 
<211> 409 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> Contig41538 



<400> 307 

aaaaaaaaaa aaaaaaaaaa aaagagttgt 
tacatagtcc gaaggtctta caactaatca 
gatgctgatt agttctcagt tttcattcag 
ctgccaaggg tcataaatgg tgactgccta 
catccagact tctggaactc aaagattaac 
cacttatagc ttcaggcatg tatttatatg 
gtattcctga tagtaatgct acaataaatc 

<210> 308 

<211> 60 

<212> DNA 

<213> Homo sapiens 



tttctcatgt tcattatagt tcattacagt 60 

ctggtagcaa taaatgcttc aggcccacat 120 

ttcacaatat aaccaccatt cctgccctcc 180 

acaacaaaat ttgcagtctc atctcatttt 240 

ttttgactaa ccctggaata tctcttatct 3 00 

tattcttgat agcaatacca taatcaatgt 3 60 
caaacatttc aactctgtt 409 



<300> 

<308> Contig41538 



ctcatgttca ttatagttca ttacagttac atagtccgaa ggtcttacaa ctaatcactg 60 

<210> 309 
<211> 552 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig41887 



<400> 309 

ctgaagacta cgaccatgaa atcacagggc 
aaagtgtcca ggtgaaactt ggagactcct 
atacccagga agtcaccctg cagccaggcg 
aagctttcct ccggggtatg gtcatgtaca 
agcttgatgg ccagatctcc tctgcctacc 
tctatggcca gtatcaactc cttggcatca 
tagaggagcc gaccactgag ccaccagtta 
gtcgctaggg tggggtatgg ggccatccga 
ggtactggag taactgagtc gggacgctga 
cagaatcagt gc 552 



tgcgggtgtc tgtaggtctt ctcctggtga 60 

gggacgtgaa actgggagcc ttaggtggga 12 0 

aatacatcac aaaagtcttt gtcgccttcc 180 

ccagcaagga ccgctatttc tattttggga 240 

ccagccaaga ggggcaggtg ctggtgggca 3 00 

agagcattgg ctttgaatgg aattatccac 360 

atctcacata ctcagcaaac tcacccgtgg 420 

gctgaggcca tctgtgtggt ggtggctgat 480 

atctgaatcc accaataaat aaagcttctg 540 



<210> 310 

<211> 60 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> Contig41887 

tactjgagta actgagtcgg gacgctgaat ctgaatccac caataaataa agcttctgca 60 

<210> 311 
<211> 745 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig42342 



<400> 311 



60 
120 



gcagtaaaga caggacgcac ccatgtcaca agaggagcac aggcaggggt gttggtgttg 
gggcagccct cagggtctcc agacccagcc ccactcacac agcagcctag gaaggaaggg 
cagagtccca ggtgtcagct ggtgggtctc ccaggagctg cccctccctg gaagtcacag 180 
gacaggaatg acagatcagg gaactgcagg aagctgccac ctctggggtc agaatatgcc 2 40 
cagcctgcgg gggctctcta tcggggtctt cgagagccag acagcctgcc ttgtgctgca 300 
tacctggctt tgctctgtgc agaacccagc acacgtgafct ttgtgtgaca tgccagcagc 3 60 
ctggctccca ggacaggagg cctgccctgg gggaggggct gcaggaggag ggggggcagg 
cacccatgag tctgtccagc cttgtcacag atgcatcgcc caagctgcgg tcctgatttc 
agctcacctc agagtaaatc agaataaact gcacccagac tttcacgaat gcatgttgac 
gctttcagtt cacccctttc tttgctaact ttcttcctat tttcttctaa tgcgagagct 
tattaattcc atatttatca ttttgaataa cttttctcct ttttagtaac aaaatgtact 
tcactcttag taaaatgtat ttactatttt agtaacaaaa atatacttgc ctaatcatgt 
ttaaaatata gtgatgtgaa aaatt 745 



420 
480 
540 
600 
660 
720 



<210> 312 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig42342 



cacccagact ttcacgaatg catgttgacg ctttcagttc acccctttct ttgctaactt 

<210> 313 
<211> 398 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig43645 
<400> 313 

agttcaaagg cagataaatc tgtaaattat tttatcctat ctaccatttc ttaagaagac 
attactccaa aataattaaa tttaaggctt tatcaggtct gcatatagaa tcttaaattc 
taataaagtt tcatgttaat gtcataggat ttttaaaaga gctataggta atttctgtat 
aatatgtgta tattaaaatg taattgattt cagttgaaag tattttaaag ctgataaata 
gcattagggt tctttgcaat gtggtatcta gctgtattat tggttttatt tactttaaac 
attttgaaaa gcttatactg gcagcctaga aaaacaaaca attaatgtat ctttatgtcc 
ctggcacatg aataaacttt gctgtggttt actaatct 398 

<210> 314 
<211> 60 
<212> DNA 



60 



60 

120 

180 

240 

300 

360 
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<213> Homo sapiens 
<300> 

<308> Contig43645 



Jaaaagctta tactggcagc ctagaaaaac aaacaattaa tgtatcttta tgtccctggc 60 



60 
120 



<210> 315 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig44289 

ctaaaaacaa cactcatcag tcttgggaaa tttgaacttt gatcaactta actaaagaag 
gaagggtagt aagaattttt caaatacaaa tatttgccaa ttcacagatg ataacattta 

aggStcaa aagtaagggt ttttccttgt ttctccagtc agcttttgtc aac c aa a 180 

gttttttcat aaacattttt tatttgtata attgcaacag tttaagaaat tatcacaact 240 

atttagaaac atttaaaatg ttctttttga tataagctat atacttggaa aaatacattg 3 00 

gtatctaaaa tttgaggtgt gttaagactg ctttttgttt taaaaaatgg tttacattca 360 

aatttttgaa gtgttttatg cttcatatgg ctaagttgta gtttggcaga gttaacagca 420 
taagaataaa catgctgtaa ttttaaaaga tgctttgaat aaaaatttat tttaattt 47 8 

<210> 316 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig44289 



60 



catcagtctt gggaaatttg aactttgatc aacttaacta aagaaggaag ggtagtaaga 

<210> 317 
<211> 556 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig44909 

accatctggg atttctacag cctgggtacc catagccaca ccaaggcttc tgggagattc 
tgcagggtca gctttccagg ctgttcccaa atagctccct gcctccccac tgcccctaaa 
gccacagcag aagagccatt catctcataa acaaaaagga agaggaaaga atgaggaagg 
accctgtgca aggttatttg caggcaggga tgggcttgta cctgacagca cccacccctg 
tgtggccccc aggccctcat caccctcaga cccctcctaa gcagttccct cattgctctt 
tggactaggc tgacagcagg aagagcaggg cccatgaccg ggtggaagtt cagttttggt 
gtctgcttca agagggggtt ttacactctg attccaggac aagcactctg aggcgggtgg 420 
gggagagaaa ccctggctct tcacccaggt ttcacacaca tgtaaatgaa acactatgtt 480 
agtatctaac acactcctgg atacagaaca caagtcttgg cacatatgtg atggaaataa 540 
agtgttttgc aatctt 556 

<210> 318 
<211> 60 
<212> DNA 



60 

120 

180 

240 

300 

360 
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60 



540 
600 
660 



<213> Homo sapiens 
<300> 

<308> Contig44909 
<400> 318 

tcacccaggt ttcacacaca tgtaaatgaa acactatgtt agtatctaac acactcctgg 

<210> 319 
<211> 710 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig45032 
<400> 319 

aaagataggc ttctaagtta aggcaaatca ttcattctgt cattaaacaa atacaaacca 60 
ggcacctgtc atatgccaag tgatattcaa aatggcccat gtagaccttt gtgaagtatg 12 0 
tggcctaaca gacattaaac aaatgtctgt gaaactgaca taataaagta aggtaagtta 180 
tatgtgagac attctctttt tataataatt cctgtaaagc agtacttact taggtaatga 240 
tatcatactg ttttgtttta tatttttcct aagagctaaa acgtcatcct ctcttcagtg 3 00 
atgtggactg ggaaaatctg cagcatcaga ctatgccttt catcccccag ccagatgatg 3 60 
aaacagatac ctcctatttt gaagccagga atactgctca gcacctgacc gtatctggat 42 0 
ttagtctgta gcacaaaaat tttcctttta gtctagcctc gtgttataga atgaacttgc 4 80 
ataattatat actccttaat actagattga tctaaggggg aaagatcatt atttaaccta 
gttcaatgtg cttttaatgt acgttacagc tttcacagag ttaaaaggct gaaaggaata 
tagtcagtaa tttatcttaa cctcaaaact gtatataaat cttcaaagct tttttcatct 
atttattttg tttattgcac tttatgaaaa ctgaagcatc aataaaatta 710 

<210> 320 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig45032 
<400> 320 

ttaacctagt tcaatgtgct tttaatgtac gttacagctt tcacagagtt aaaaggctga 60 

<210> 321 

<211> 726 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig46218 
<400> 321 

atacatattg ctttagagag caggtaggtg gccatgtgtt cagcagtgtg tccttaagaa 6 0 

aataccatct ttctaagcca ctggaatttt tactttacta tttttaacat taatggatgt 12 0 

caggtcatca acctcaagtc tttacatatc catgtatatt ccatatatat tgtttatata 180 

ggcccaagtt tctccttaat tgggatctat atactaccag cacaacatca aaaacatgta 240 

attgaataca tcagagctat atatgtaagg aaatgactgg tgaccccatt atcatcattg 3 00 

ttgaattcat gttaagtaga ccctctaggg gaccataagg caattgagca cataacgaaa 3 60 

aatgatgcaa taagaatgta tgcactctct ttgccaaatg catgtgcttt tgtgtaacgt 420 

ggatgtaaac agaattgcag tgctgccgaa attcttgatc ttggctaaga gagtattttt 480 

ccccttgtaa ttatgactct gagataaaat tgccattttg aaatttccaa agtaacaact 540 

ttttttattt tatgaataaa cttgggattg caatttctct gatctgacaa tcaataactt 600 
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taacaaagat ctaaataagt gtttcaagga aagttttcct aagcaaatgt aatattacct 660 
catttgggca tcattactct gttaattcta tatcaaagga aataaacttg ctacttgcac 72 0 
taaatg 72 6 

<210> 322 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig46218 
<400> 322 

accataaggc aattgagcac ataacgaaaa atgatgcaat aagaatgtat gcactctctt 60 

<210> 323 
<211> 580 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig47096 
<400> 323 

ggtggtctct catccttgtg tgctgctctg ctaagagatg tccaaggcgg agccggggca 60 

agatccttcc agactcatct gtcagagccc caagcccttt agacccagag cccaaggacc 12 0 

atgcctttgg gacattagga ctgcagcctt tgcttctgtg tattttggag ttttggtgac 180 

ttttgtcacc tggacacact catttgttag ccatagtggg ttcccttggt cagcaacagt 240 

gcatgtacct ctggatgtca tctgaggtga gaccaccgag gccttttctc tctgtgtaca 300 

gaggggagtt aggagttgct ggactggatg cattacgagg actggggaca gggtagaggg 360 

acatccaggg atcagggcat gagtgggggc aaccccccgg cctctgccct ggcatggtct 420 

ccgcatgggc tgaggtgtag ctgattggct gccacatttc ggccatgctg gctggcgtgc 480 

ccatgttgca gatattttcc cgagttcccc agaatggatg gtattgaatc tcagccacat 540 

gcaacactgt gtccagcatt ctttgcaata aatacttttt 5 80 

<210> 324 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig47096 
<400> 324 

atattttccc gagttcccca gaatggatgg tattgaatct cagccacatg caacactgtg 60 

<210> 325 
<211> 632 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig47563 
<400> 325 

gccatctagt ctgtggtttt ctgttgaagc agtctgaatt gactaaaaca gtcacttgga 60 

gtagttataa accactttcc tgttgaaagc agaacatgct gattcaactg ttttgttcaa 120 

tagcaatgat agattttgtt taagtcccct acactttctt atttctaaat gatcaagagt 180 

acacttcctg gcagtgatta aggagtgtgt atctaacaga aaaaatatat ataccctgtg 240 

aacccgaata tggaattcag attgtttctg ccctcagtat catacttaaa aaacaagcat 3 00 

acaaacaaac ataagggaac aaacagcaac cataacaaaa acaaaccttt aaaggtgggt 360 
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ttttgctgtg ataaatgaat acggtactct 
aaactgcaag tgatttaaaa attagagaat 
cagaaaacct caagtgaatt ttgtatgtaa 
tttaagcaaa caattagctg aaaacttggt 
gcacccatca gaaaataaag ggcattaaat 

<210> 326 

<211> 60 

<212> DNA 

<213> Homo sapiens 



gaaggagaaa aaagtttctc aaatgagctt 42 0 

ataattctta aagctattga aagtttcaac 480 

atgaaatctt gaatgtaagt tctgtgattc 540 

attgttgtag tttatgtagt aagtgacttg 600 
tg 632 



<300> 

<308> Contig47563 
<400> 326 

agcaaacaat tagctgaaaa cttggtattg ttgtagttta tgtagtaagt gacttggcac 60 

<210> 327 
<211> 540 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> Contig48913 



<400> 327 

accagagggt gtcccttttc cacagtaatg 
agggaggtgg tcaagcttga aaaactggct 
ccaagtgttc tcaacctcca ttctgcagtg 
cccagcatcc aggtgttgtg tggcttagcg 
tggaagcagc tttctggttc ttgtgattgt 
cgaggatcct cagtgaccat ggtggccaca 
gctgaagaac cttgcctgtg gcacttttcg 
cccgtgctgg gactcatgcg ggtcatttcc 
tatgagcacc atgtaagcct ccttgtattg 

<210> 328 

<211> 60 

<212> DNA 

<213> Homo sapiens 



ggatcggctg gtgtgccttc agggaggaag 60 

ttaggatggt tctgactttg ttctccctcc 12 0 

ttcagagttt tagggaaagg gtttgggtgc 180 

catgtgaagt gaaaaccttc tggggttgtt 240 

atcctgaggt cccagaaccc tattctccca 3 00 

cgcctggcca gcctgctggc tcctgggtga 360 

agggtgagct ggaaccgaga gaacatggtc 42 0 

tgccggcctg gtttcgcctg gtcgtgtctt 480 

agataattgg gcattaaaca ttaaactgca 540 



<300> 

<308> Contig48913 



<400> 328 

tatgagcacc atgtaagcct ccttgtattg agataattgg gcattaaaca ttaaactgca 6 0 

<210> 329 
<211> 534 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig49169 
<400> 329 

cctaatgtta acatttttaa aaatacatat ttgggactct tattatcaag gttctaccta 60 
tgttaattta caattcatgt ttcaagacat ttgccaaatg tattaccgat gcctctgaaa 12 
agggggtcac tgggtctcat agactgatat gaagtcgaca tatttatagt gcttagagac 18 
caaactaatg gaaggcagac tatttacagc ttagtatatg tgtacttaag tctatgtgaa 24 
cagagaaatg cctcccgtag tgtttgaaag cgttaagctg ataatgtaat taacaactgc 3 0 
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tgagagatca aagattcaac ttgccataca 
caaatctaca gttctgtttt tgctactcta 
tgtatttgag acaaataggt gatactgaat 
cattggcacc tcaatgataa agaaatttaa 



cctcaaattc ggagaaacag ttaatttggg 3 60 

ttgtcattcc tgtttaatac tcactgtact 42 0 

tttatactgt tttctacttt tccattaaaa 480 
ggtataaaat taaatgtaaa aatt 534 



<210> 330 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig49169 
<400> 330 

catacacctc aaattcggag aaacagttaa tttgggcaaa tctacagttc tgtttttgct 60 

<210> 331 
<211> 602 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig49388 
<400> 331 

tgtcagtgga ggggtctctg cagccaactg agactatctt gctgtgccct gagccttcct 
agggtttaga agaacagcat tcaaaattcc ccgtcctgtc agtgtttgcc ttcgcacctc 
ctcccctaaa gcagcgcggg gggcaaataa gaccccaccc ctccctgcag cttcacaggg 
acgcttcctt ccctccccgc aaccacccca ggctcccctg ggaggctgca gttgtggtac 
acgtccccgg tgctgggttg gccgtgactc gggggcgggg cgatcgggtc tcagcccctg 
ccttccccag tctctgggtc acccgaattt tcccacccct gcttctcccc gaggaggttg 
agctcttgag caagttggga cttgggccgg ggcctggaag aatgattggc tgggaggccg 
cgggagggag gccaggaggc ccggaccagt tgggaggagt gagcaggccc cgggggaggg 
ggatgagcgc agtttgctcg ctttcctccc ctgccggccc cctccgcccc cacacacact 
cgggacgtct tcattgaaga ttcacttaca aaggaatgtt tcactaaata aaagaaaacc 
ag 602 

<210> 332 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig49388 
<400> 332 

cgggacgtct tcattgaaga ttcacttaca aaggaatgtt tcactaaata aaagaaaacc 6 0 

<210> 333 
<211> 562 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig50728 
<400> 333 

gcgaatttgg gccccttgat cctctgatgg gagctgaaag gatgagaggt gggcatctag 60 
atttagggag gctgttcagg ctttgcaggt cccttacctg aacacataga aaccctggag 120 
ctgtgactgt gtccatgtgt gtgtgtttgt ctgtgtgtgt tgcgggggat gggcacctgc 180 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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atgaatgtgg tagagaaaat ggctctgctc agagggaaga tacgcatagc aaggcaggga 
ccagaggaat cacaggcgcc tggagagcag ccgggcaccg cctccaggga cctgccggct 
tccctcagtc ctccaggggc ccagcactct tcctttaggc cctgtgagcg tcccttgtca 
ggatacattc tctcattttg ctgaagctga tttgattggg tgtctgtttc tcgcagccaa 
aagagctctg aatgaggaaa gtgcttctgt gctaactccc cgcgtctcct gaatttcagt 
cattcatgta cccgcctcga aatttttgca atatctgtgt accaactgtc catttactta 
ataaagaagt tttctttaaa tt 5 62 

<210> 334 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig50728 
<400> 334 

tttgattggg tgtctgtttc tcgcagccaa aagagctctg aatgaggaaa gtgcttctgt 6 

<210> 335 
<211> 400 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AI497657 
<400> 335 

tttttttttt tgcacttatg gtatttattg ttggaagatt gagtacctta atgcacacca 
atgctcagat gacttggggg cacatagggg actgctgtca ccatgcctca ctcctgcagg 
gaaggggctg ccctactaaa accccagcgg gcccagtgct gtgtccagaa caggtcctta 
tattactgca gcccacaatg gaactactga gtaggagcca aaagaggagg gagcaggaag 
aggtggcatt tggagagggg agaccgcacc cacaggtctg ccacagcgcg tcaacggtat 
ggggtacttt tacagtcaag ttgacttcgg tgtccgccca ccatctacct ttgtaggacc 
actgaaacaa gggacatcca ccacggccca cagccggggc 400 

<210> 336 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AI497657 
<400> 336 

gagcattggt gtgcattaag gtactcaatc ttccaacaat aaataccata agtgcaaaaa 

<210> 337 
<211> 475 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig50950 
<400> 337 

ctggaagagg ctcccaaccc agagtgtccc tgtgggaggc aggcagaagg tgacaattga 
cacgatttcc tgcacgcgtc ctcctctacc ttggaagcag ttagaatcta ccaggcacag 
atgaggccgc ccttgcctga cggagcttga tgagcagccc ttggtctccg gttccaggac 
tgagagccca gctgcctctg cccacccttc cccaggcctc tgccagcctc tggctgcacg 
gtcaggccct gccccatggc aggcctgcca gagcttggct ggggacccct cccgcctctg 
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gctccctgat gggctggatg taacttgtgt cttctagccc cttaaggagc ccaggtgttt 3 60 
taaggaatga attggtcact gcatcttgta tcgattatgg ttctgagaaa agcaaatatc 42 0 
acttttggct gcattaaaag aagcatcata tataaaataa agaagatgaa ggtct 475 

<210> 338 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig50950 
<400> 338 

gtcactgcat cttgtatcga ttatggttct gagaaaagca aatatcactt ttggctgcat 

<210> 339 
<211> 860 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig51660 
<400> 339 

ggatggcaac cttcagctag actgcctggc tcaagggtgg aagcaatacc aacagagagc 
atttggctgg ttccggtgtt cctcctgcca gcgaagttgg gcttccgcca agtgcagatt 
ctgtgccaca cgtactggga gcactggaca tcccagggtc aggtgcgtat gaggctcttt 
ggccaaaggt gccagaagtg ctcctggtcc caatatgaga tgcctgagtt ctcctcggat 
agcaccatga ggattctgag caacctggtg cagcatatac tgaagaaata ctatggaaat 
ggcatgagga agtctccaga aatgccagta atcctggaag tgtccctgga aggatcccat 
gacacagcca attgtgaggc atgcactttg ggcatatgtg gacagggctt aaaaagctac 
atgacaaagc cgtccaaatc cctactcccc cacctaaaga ctgggaattc ctcacctgga 
attggtgctg tgtacctcgc aaaccaagcc aagaaccagt cagatgaggc aaaagaggct 
aaggggagtg ggtatgagaa attagggccc agtcgagacc cagatccact gaacatctgt 
gtctttattt tgctgcttgt atttattgta gtcaaatgct ttacatcaga atgatgaaaa 
taggcttgcc actttctctt attttaattc catggtagtc aatgaactgg ctgccacttt 
aatataactg aaaattcatt ttgagaccaa gcaggatcaa gtttgtagaa taaacactgg 
tttcctagcc atcctctgaa aacagtatga aacatgacca agtacataat ggatttagta 
ataaatattg tcgaattgct 860 

<210> 340 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig51660 
<400> 340 

gctgcttgta tttattgtag tcaaatgctt tacatcagaa tgatgaaaat aggcttgcca 60 

<210> 341 
<211> 608 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig52490 
<400> 341 

atcgtggcta gcggacagac acgagcctct tgggaatacc ttgtccatca cgtcatggcc 60 
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atgggtgcct tcttctccgg catcttttgg agcagctttg tcggtggggg tgtcttaaca 12 0 

ctactggtgg aagtcagcaa catcttcctc accattcgca tgatgatgaa aatcagtaat 180 

gcccaggatc atctcctcta ccgggttaac aagtatgtga acctggtcat gtactttctc 240 

ttccgcctgg cccctcaggc ctacctcacc catttcttct tgcgttatgt gaaccagagg 3 00 

accctgggca ccttcctgct gggtatcctg ctcatgctgg acgtgatgat cataatctac 360 

ttttcccgcc tcctccgctc tgacttctgc cctgagcatg tccccaagaa gcaacacaaa 42 0 

gacaagttct tgactgagaa ctgagtgagg ggcacagagc ctgggacaac aaaaacggac 480 

aaggccagaa acagcttcat atggacactg ggacttagcc ccaagcctgg gtgtcctctg 540 

aggccagcct ctccaccttc tgagcctgcg cccacactat tgaaaacact aatgaaagta 600 
ctcctctg 608 

<210> 342 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<3 00> 

<308> Contig52490 
<400> 342 

ccaggatcat ctcctctacc gggttaacaa gtatgtgaac ctggtcatgt actttctctt 60 

<210> 343 

<211> 1282 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig53598 
<400> 343 

catgccagca cctttgaacc ggtctcttag aagaagacac acatcctggg tgtacagtgg 60 

tgaaatgggg agtgggtgcc cattctgaaa aacgaggcat tcctgctcat tccctctgct 12 0 

tagctggtgg gcaggggaga gagggaaatg ccaaaaactt ggagtgaagg atgatgctat 180 

tttttatttt taaatatatc ttcaggttat tttcttactg ttgcttcaga tctaatgtaa 240 

aaggcagatg tcccctcctc tccacccccg acgctgaccc cggcctcagt cacggctctt 300 

tgcatgatca cagttctgtg ttctggcctg tggcagggcc gggaagggcc gctggcttcc 3 60 

gaacagacgt ggttgctctc cacgaggcgc atggggagcc cgcgggccct aagctttgtc 42 0 

gcagatgtca tcattggcag aattacttgt cttgaaaaat aagtagcatt gctgaaacac 4 80 

acaaccgaat tctctacgat ggccatttgc tcattgtctt tcctctgtgt gtagtgagtg 540 

accctggcag tgtttgcctg ctcagagtgg cccctcagaa caacagggct ggccttggaa 6 00 

aaaccccaaa acaggactgt ggtgacaact ctggtcaggt gtgatttgac atgagggccg 660 

gaggcggttg ctgacggcag gactggagag gctgcgtgcc cggcactggc agcgaggctc 72 0 

gtgtgtcccc caggcagatc tgggcacttt cccaacccag gtttatgcgt ctccagggaa 7 80 

gcctcggtgc cagagtggtg ggcagatctg accatcccca cagaccagaa acaaggaatt 840 

tctgggatta cccagtcccc cttcaaccca gttgatgtaa ccacctcatt ttttacaaat 9 00 

acagaatcta ttctactcag gctatgggcc tcgtcctcac tcagttattg cgagtgttgc 96 0 

tgtccgcatg ctccgggccc cacgtggctc ctgtgctcta gatcatggtg actcccccgc 1020 

cctgtggttg gaatcgatgc cacggattgc aggccaaatt tcagatcgtg tttccaaaca 10 80 

cccttgctgt gccctttaafc gggattgaaa gcacttttac cacatggaga aatatatttt 1140 

taatttgtga tgcttttcta caaggtccac tatttctgag tttaatgtgt ttccaacact 12 00 

taaggagact ctaatgaaag ctgatgaatt ttcttttctg tccaaacaag taaaataaaa 12 60 
ataaaagtct atttagatgt tg 12 82 

<210> 344 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig53598 
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<400> 344 

ccactatttc tgagtttaat gtgtttccaa cacttaagga gactctaatg aaagctgatg 60 

<210> 345 
<211> 601 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig53641 
<400> 345 

tggaggctgt ggatgatgct ttcaagacaa tggatgtgga tatggccgag gaacatgcca 6 0 

gggcccagat gagggcccag atgaatatcg gggatgaagc gctgattgga cggtggagct 120 

gggatgacat acaagtcgag ctcctgacct gggatgagga cggagatttt ggcgatgcct 180 

gggccaggat cccctttgct ttctgggcca gataccatca gtacattctg aatagcaacc 240 

gtgccaacag gagggccacg tggagagctg gcgtcagcag tggcaccaat ggaggggcca 3 00 

gcaccagcgt cctagatggc cccagcacca gctccaccat ccggaccaga aatgctgcca 3 60 

gagctggcgc cagcttcttc tcctggatcc agcaccgttg acgaactgca gcgatcttac 42 0 

tggccaagcc agagcgcctc ctctcagatt ccttctcgac acagcaccct aggcggcttc 480 

ttcctgtcag tcggaggtgg catgcaagat gaagctctct ttgctcttcc tgctttcatt 540 

ttgtgctttt ccttgtgttt tcatgttttg ggtatcagtg ttacattaaa gttgcaaaat 600 
t 601 

<210> 346 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig53641 
<400> 346 

ctttcatttt gtgcttttcc ttgtgttttc atgttttggg tatcagtgtt acattaaagt 60 

<210> 347 
<211> 751 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig54242 
<400> 347 

aattactcaa agaaggagcc atttcagtta actcaagtga atgaaagact tttggaatct 6 0 
gcagtgggtc cttccctgtt gaccatttgg taacttgtaa tctgaccaaa aactcttgag 12 0 
ctgcaacagg ccttgccaga gggctcagga tgggaaagga agaaggggat aggaaaagaa 180 
gaggtaattt tacatttccc ctttaaagta aattttagcc aactcatcat tctgaaatgt 240 
ccctataaag aatgagtcga actagaccag aagccagcct actccttctt acatagcttc 3 00 
tccaacaggg gtagcaatga cctgtccact tcaaacacag ataaggcctg ccatcctcat 3 60 
tggttaaagg cacacgtgag actttcagtg ggctctgctg agaaggaagg cagcccagga 42 0 
gtcaggtatg caggcattgc attgtcagtg tctgctctca gagtttacac attcaattgc 480 
ttccaagggt gaatctcctg ctctgtgaat gctatcagac cccaaaggcc aaccttgggc 540 
tgggtctatg tacgttcttc cgaagcactg atgatcaaaa ttgaagacac attcagaggt 600 
ttgattggtt gagattaact ggtgtggtgg ttggtgtatg tatgttttat ttttatgtct 660 
ttgtatgtag ttctacataa tgcaaattgt gctttctgat ggacaagacc tcataactgt 720 
gattaatatc aataaaaagg ggatgttgtg g 7 51 

<210> 348 
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<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig54242 
<400> 348 

gtaaatttta gccaactcat cattctgaaa tgtccctata aagaatgagt cgaactagac 60 

<210> 349 
<211> 637 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig54661 
<400> 349 

ggcagtgatg tctatgttga gattaactta tgtattgagg aaaatttgaa gtttattttt 60 

tcgatgaata aggctgtcaa atgatttagt atagattaat gacatctttt ttagaaatat 12 0 

taaagtgagt attcctcatt atgtcatcat ttctgataat tagagtgcta atttgaatgt 180 

tagataatgt ttccacatct atacctattt ctttctaggg cacttctgac cctggggctt 2 40 

ggggatggcc tttaggccac agtagtgtct gtgttaagtt cactaaatgt gtatttaatg 3 00 

agaaacattc ctatgtaaaa atgtgtgtat gtgaacgtat gcatacattt ttattgtgca 360 

cctgtacatt gtgaagaagt agtttggaaa tttgtaaagc acaaaccata aaagagtgtg 42 0 

gagttattaa atgatgtagc acaaatgtaa tgtttagctt ataaaaggtc ctttctattt 480 

tctatggcaa agactttgac acttgaaaaa taaaaccaat atttgattta tttttgtaag 54 0 

tatttaggat attattttaa ataaatgatt gtccattatc aatataatag ttgtgaaatg 600 
atttaagtaa ataaacttta tgcttctgtg tctgttg 637 

<210> 350 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig54661 
<400> 350 

ctgtacattg tgaagaagta gtttggaaat ttgtaaagca caaaccataa aagagtgtgg 60 

<210> 351 
<211> 924 
<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig55188 
<400> 351 

gcgacaagta ccgcaagcgg gcactcatcc tggtgtcact gctggccttt gccggcctct 60 

tcgccgccct cgtgctgtgg ctgtacatct accccattaa ctggccctgg atcgagcacc 120 

tcacctgctt ccccttcacc agccgcttct gcgagaagta tgagctggac caggtgctgc 180 

actgaccgct gggccacacg gctgcccctc agccctgctg gaacagggtc tgcctgcgag 240 

ggctgccctc tgcagagcgc tctctgtgtg ccagagagcc agagacccaa gacagggccc 3 00 

gggctctgga cctgggtgcc cccctgccag gcgaggctga ctccgcgtga gatggttggt 3 60 

taaggcgggg tttttctggg gcgtgaggcc tgtgagatcc tgacccaagc tcaggcacac 420 

ccaaggcacc tgcctctctg agtcttgggt ctcagttcct aatatcccgc tccttgctga 480 

gaccatctcc tggggcaggg tccttttctt cccaggtcct cagcgctgcc tctgctggtg 540 
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ccttctcccc cactactact ggagcgtgcc cttgctgggg acgtggctgt gccctcagtt 600 



660 



60 



gcccccaggg ctgggtgccc accatgcccc ttcctctttc tcctcctacc tctgccctgt 

gagcccatcc ataaggctct cagatgggac attgtgggaa aggctttggc catggtctgg 720 

gggcagagaa caagggggga gacacaagta gacctcaggt agaacgacac tgggcggagc 780 

caccccaggg cctgctccca gggagtgctc gaggcgcatc aggcccgttt tttaccagtt 840 

fcatatcacgg tcttcatttt taaaagtaac gctaactttg tacggacgat gtctcatgga 900 
ttaaataata ttctttatgg cagt 924 

<210> 352 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig55188 
<400> 352 

agtaacgcta actttgtacg gacgatgtct catggattaa ataatattct ttatggcagt 

<210> 353 
<211> 699 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig55353 
<400> 353 

tgattatgcc aagagctcta aacagaagtt tgagaaggta aaaattaagt tgtagtatct 60 

gagttgtttt tattttcttc ctttggtgtt tatgaaggta ttcataagaa ctttaatttc 12 0 

aggggaaaaa atgcctgatt tgctattttt gacatttcct cgtctcttaa gaagtcagtt 180 

aaatatgttt tcatagttta tattcctgtt tcatagatta ctgtgaaaca tgtatttaaa 240 

cc tatgaatt ataaaatagt atttagattc tagcgtgagt taaatagatt agtcatatat 3 00 

cttttagatt tgtggatttg acatgtaaat tatgtgttgt gtataagtaa gttagttact 360 
aaacatatgg catggttatt gataaacttg ttgctatttt tttccaaatg ctatcagtgt 420 

ttgtggactt ttaaaaatta gtttgaattt tggaatgttc tgtgataaaa tataatttca 



480 



actattttgt acatttaaat atgccatgtt gtatatgtct gtatttaaaa atgttgtaaa 540 



600 



tatctgcatt ttaagaatta tgaaagattt tcctcaaaaa tgacagaact ctccatactt 
aattgtgaca cattataaga tatctgattt taagcttttg gattttgttc taaaaattaa 660 
gtttaaacat gctgaaaatt ccataaaaat aaaattttg 699 

<210> 354 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig55353 
<400> 354 

taaaatagta tttagattct agcgtgagtt aaatagatta gtcatatatc ttttagattt 

<210> 355 

<211> 809 

<212> DNA 

<213> Homo sapiens 



60 



<300> 

<308> Contig56503 
<400> 355 
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gcatgtgaga tgagtgactg ccggtgaatg tgtccacagt tgagaggttg gagcaggatg 60 

agggaatcct gtcaccatca ataatcactt gtggagcgcc actctgccca agacgccacc 12 0 

tgggcggaca gcatggagct ctccatggcc aggctgcctg tgtgcatgtt ccctgtctgg 180 

tgcccctttg cccgcctcct gcaaacctca cagggtcccc acacaacagt gccctccaga 240 

agcagcccct cggaggcaga ggaaggaaaa tggggatggc tggggctctc tccatcctcc 300 

ttttctcctt gccttcgcat ggctggcctt cccctccaaa acctccattc ccctgctgcc 360 

agcccctttg ccatagcctg attttgggga ggaggaaggg gcgatttgag ggagaagggg 42 0 

agaaagctta tggctgggtc tggtttcttc ccttcccaga gggtcttact gttccagggt 480 

ggccccaggg caggcagggg ccacactatg cctgcgccct ggtaaaggtg acccctgcca 540 

tttaccagca gccctggcat gttcctgccc cacaggaata gaatggaggg agctccagaa 600 

actttccatc ccaaaggcag tctccgtggt tgaagcagac tggatttttg ctctgcccct 6 60 

gaccccttgt ccctctttga gggaggggag ctatgctagg actccaacct cagggactcg 72 0 

ggtggcctgc gctagcttct tttgatactg aaaactttta aggtgggagg gtggcaaggg 7 80 
atgtgcttaa taaatcaatt ccaagcctc 809 

<210> 356 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig56503 
<400> 356 

gaaaactttt aaggtgggag ggtggcaagg gatgtgctta ataaatcaat tccaagcctc 60 

<210> 357 
<211> 976 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig56678 
<400> 357 

gaaggatata ctttgttata acttattatt ttgttctctg taaatacaag atgtttatag 60 

gaaatatgta ttctgaactc tatctgcaga atgagtcact acaccaaaat agttctatta 12 0 

tttagaatgt gttaatttta aagggacctg ataggtattt atttacatat gcgatccaca 180 

tttgtgtgaa agcatgtgat catactaacc cagcctcctg gaatgtcgct gtacgatgat 240 

tgatgtcttt ttctcagtcc atagttacaa ttgtttagta tgctaatcag tccagttccc 3 00 

tgaggtttaa gatcaaatat aaattactct gcttttcgac tcattcaggt agcattgtac 3 60 

ctgaacctga ttgctacttt ttcatcttaa atattatatt tcctcatcta atctgccttc 42 0 

ccctcatcca cagacatttg gagaaggaaa tgggagggtg tctgttatcc ctttctcttt 4 80 

gctttgtccc cgttgttaga ctggcagcgt cagttgctcg gtgggcttgg ttagagccgt 540 

gggtgaggca ggtggctggc ggggacaggg agaggctgag agggaagtgg tggcatttac 600 

tgctctgaca cttccactgt ccctgctggg gatgctgggg ccaaggcctg tggggcctgt 660 

gaactgcaca gccaggagca aggaacccac taaatactcc gtcacctcca tgtcccctct 72 0 

acagtgttaa attattacat aagcaggtga aaggtagaag gcgaattatg tgagtaaata 7 80 

tggtctgttt tctcttcagc aaaaatgact atttttgtgt gtgactaatt tatttttatt 840 

attgtaaaga tacaataaac cggttgaaat atctgctttg ttgacaagcg tgtgctttct 900 

ctggccttat tcgcgttctg ttctcctgca aatagcgccc tctaaaaaga agagtcagac 960 
aataaactgg ttgaaa 976 

<210> 358 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig56678 
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660 
720 
780 
840 
900 



<400> 358 

tattacataa gcaggtgaaa ggtagaaggc gaattatgtg agtaaatatg gtctgttttc 6 0 

<210> 359 

<211> 1118 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig57584 
<400> 359 

agctgttgtg catccagagg tggaattggg gcccggcatt ccctcctcgt cccgggctgg 60 

cccttgcccc caccctgcaa ctcctggttg agatgggctc agccaagagc gtcccagtca 12 0 

caccagcgcg gcctccgccg acaacaagca tctggctcga gtggcggacc cccgttcacc 180 

tagtgctggc atcctgcgca ctcccatcca ggtggagagc tctccacagc caggcctacc 240 

agcaggggag caactggagg gtcttaaaca tgcccaggac tcagatcccc gctctcctac 3 00 

tcttggtatt gcacggacac ctatgaagac cagcagtgga gaccccccaa gcccactggt 3 60 

gaaacagctg agtgaagtat ttgaaactga agactctaaa tcaaatcttc ccccagagcc 42 0 
tgttctgccc ccagaggcac ctttatcttc tgaattggac ttgcctctgg gtacccagtt 480 
atctgttgag gaacagatgc caccttggaa ccagactgag ttcccctcca aacaggtgtt 540 
ttccaaggag gaagcaagac agcccacaga aacccctgtg gccagccaga gctccgacaa 600 
gccctcaagg gaccctgaga ctcccagatc ttcaggttct atgcgcaata gatggaaacc 
aaacagcagc aaggtactag ggagatcccc cctcaccatc ctgcaggatg acaactcccc 
tggcaccctg acactacgac agggtaagcg gccttcaccc ctaagtgaaa atgttagtga 
actaaaggaa ggagccattc ttggaactgg acgacttctg aaaactggag gacgagcatg 
ggagcaaggc caggaccatg acaaggaaaa tcagcacttt cccttggtgg agagctaggc 
cctgcatggc cccagcaatg cagtcaccca gggcctggtg atatctgtgt cctctcaccc 9 60 
cttctttccc agggatactg aggaatggct tgttttctta gactcctcct cagctaccaa 1020 
actgggactc acagctttat tgggctttct ttgtgtcttg tgtgtttctt ttatattaaa 1080 
ggaagtaatt ttaaatgtta ctttaaaaag gtatatgt 1118 

<210> 360 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig57584 
<400> 360 

aggaatggct tgttttctta gactcctcct cagctaccaa actgggactc acagctttat 

<210> 361 
<211> 859 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig63649 

<400> 361 _ n 

gtcgcagggt accagtgtgc ggagttcctg ttgccaagct gaaggtggcc ctgggcaggc 60 

acaggtgtgg tcatatcttc agccaacagg accatcctcc ggagggccac ctctggggac 120 

ttcctacggg aagagagtga cagatttggt gcttctgtgt gtttctgccg cttcagtggg 180 

gccgctgcgg gagacagcgg gtggatcctc cagcagcctg tctgctgagc ctgccttctc 240 

aagtctactg ttaaaatcag gaccgggtcg tgtccgagcc tacaggccct gtctccgctc 3 00 

cccaggcctg caggagttga gggctgcacc tgctcgctgg agagggagag gcagatttag 360 

tggacgcctg gcatggactc ggactggcct ttggaagctc cctgccctga cgggttgcct 42 0 

gtcaccactg cgaagtgagg cttggaggac ctgcacctga gaaaggctgt gtgtggtctt 480 



60 
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gggtccacac ctgccagagc taacttactg ccagacggcg acttactgtg ggccaccctc 540 

agtgaaccgg ggtgtcctca gctggcccta cagagcactt ctgtgctggg gatgagtagg 600 

aactctgggc gaggagggtc ccagcgccgc ccctcgatac agccctgctc tgccctctgc 66 0 

ccgtacttat accaggtggg atccctgccc tgcattgcct ggggattggc tgggcttggg 72 0 

cacgccctgc tgtggaactg gatgttttca gggagcccag cctttcctca tgtcaacaca 780 

gttcacaata tagttttcaa agtacagttt aaaactcaaa agtaaacttt tcagcaactc 840 
aaaaaaaaaa aaaaaaaaa 859 

<210> 362 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig63649 
<400> 362 

cagcctttcc tcatgtcaac acagttcaca atatagtttt caaagtacag tttaaaactc 6 0 

<210> 363 

<211> 1170 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig63525 
<400> 363 

gccatggctc cctgggcgga gcgagcactc gcggctgaac ccgctgcgcg cggtgtggct 60 

cacgctgacc gccgccttcc tgctgaccct actgctgcag ctcctgccgc ccggcctgct 120 

cccgggctgc gcgatcttcc aggacctgat ccgctatggg aaaaccaagt gtggggagcc 180 

gtcgcgcccc gccgcctgcc gagcctttga tgtccccaag agatattttt cccactttta 240 

tatcatctca gtgctgtgga atggcttcct gctttggtgc cttactcaat ctctgttcct 300 

gggagcacct tttccaagct ggcttcatgg tttgctcaga attctcgggg cggcacagtt 360 

ccagggaggg gagctggcac tgtctgcatt cttagtgcta gtatttctgt ggctgcacag 42 0 

cttacgaaga ctcttcgagt gcctctacgt cagtgtcttc tccaatgtca tgattcacgt 480 

cgtgcagtac tgttttggac ttgtctatta tgtccttgtt ggcctaactg tgctgagcca 540 

agtgccaatg gatggcagga atgctacata acagggaaaa atctattgat gcaagcacgg 600 

tggttccata ttcttgggat gatgatgttc atctggtcat ctgcccatca gtataagtgc 660 

catgttattc tcggcaatct caggaaaaat aaagcaggag tggtcattca ctgtaaccac 72 0 

aggatcccat ttggagactg gtttgaatat gtttcttccc ctaactactt agcagagctg 7 80 

atgatctacg tttccatggc cgtcaccttt gggttccaca acttaacttg gtggctagtg 840 

gtgacaaatg tcttctttaa tcaggccctg tctgcctttc tcagccacca attctacaaa 900 

agcaaatttg tctcttaccc gaagcatagg aaagctttcc taccattttt gttttaagtt 960 

aacctcagtc atgaagaatg caaaccaggt gatggtttca atgcctaagg acagtgaagt 102 0 

ctggagccca aagtacagtt tcagcaaagc tgtttgaaac tctccattcc atttctatac 1080 

cccacaagtt ttcactgaat gagcatgcag tgccactcaa gaaaatgaat ctccaaagta 1140 
tcttcaaaga attaattact aatggcagat 1170 

<210> 364 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig63525 
<400> 364 

ctcttacccg aagcatagga aagctttcct accatttttg ttttaagtta acctcagtca 60 
<210> 365 
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<211> 632 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig64688 
<400> 365 

aagaatgcta agatgatttc agatatcgaa aagaaaaggc agcgtatgat tgaagtccag 60 

gatgaactgc ttcggttaga gccacagctg aaacaactac aaacaaaata tgatgaactt 12 0 

aaagagagaa agtcttccct taggaatgca gcatatttct tatctaattt aaaacagctt 180 

tatcaagatt attcagatgt tcaagctcaa gaaccaaacg taaaggaaac gtatgattca 240 

tccagccttc cagctctgtt atttaaagca agaacacttc tgggagccga aagccatctg 3 00 

cgaaatatca accatcagtt agagaagctc cttgaccagg gatgagaaga gcagtctact 3 60 

aaaatgtgcc tataggaaga ctagtctcat gctgttacct tctgaaactg tacctttata 420 

aatcaattgt tttgcaaaga agttatggcc tacttagaat ctaaaatttg ttattcaaat 4 80 

taaatggctg tgaacaatgt taaatagcat cagtttgtcc aatagtttta aaggccataa 540 

tcatcttttc tggttaatat cttgagtaat tttaaaatgt tgacacctta atcggtccca 600 
ggtatgagcc ataataaact tgtaaaatta ag 632 

<210> 366 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig64688 
<400> 366 

ggctgtgaac aatgttaaat agcatcagtt 



tgtccaatag ttttaaaggc cataatcatc 60 
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